Attorney Docket No. 16528A-020500 



This application is a continuation-in-part of V.S 
r serial No. 08/198,431, tiled February 17, 

patent Application Serial mo , p.Kruar/ 17, 1995, 

serial »o. J^ s^ial No- 

Serial NO. 08/«.,es4, file Apri 1 95^ S ^^ ^^^^^^^^ 

„B/.37,874, : /„,,SSS, filed M.rch 

r:9r:ri « =o:;eri:43o, fued Mar. . i^., senai 

PC./.S9./OS480, filed April - --:7S%, filed 

"''"■*r9S8'"s:riarN 08 7 Ls.: fHea septe^er .7, 199e 
July 3, 1996, serial No. / specifications of 

which are herein incorporated Dy re 
for all purposes. 

.ecursiv7;^^;;;;r==^nta Pe— 
. . • . .vcles of recombination and screening or selectio 
"iver;^^^^^^^^ -nes. whole plas.i.s or viruses, 
- tl.ene or ev .h . .en.es^ .^^^^^^^ 

^^^^^^^J^^^^^^^^^ computation retired .y 

require ^^^-^^^^^^^ polypeptide engineering. Recursive 

conventional methods ^^"^ ^ recombination of large 
sequence recombination allows the rec cycles, 
f n,M^ations in a minimum number of seiecci y 
3 numbers of recombination events. 

■^o r.;,rt-icular advantages in that tney h 
techniques provide particular 

— "-r""fr;t^ ore^Lrin. t>.e _ 
^ rrX:::: coIblnaLcns of .utatlons can affect a desired 

result . 



in son,e instances, however, structural and/or 
. „n fs available which, although not 

required for ^^.^^ ^he technique. In other 

opportunitxes of a large number of 

instances, selection and/or screening ^ ^ 

. . costlv or time-consuming. A turcne 

recombinants can be costly ^^.^ 

rrrr aLU. the. iss^a a. 
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S^^SLof the invention 
one aspecrir^hrT;:^^;;^^^;^ for evolvmg 

• ded by a DNA substrate molecule comprising: 

a protein encoded by a DNA ^ ^3^. ^nd second DNA 

(a) digesting at least a f 1 ^ ^^^^^^ 

substrate molecule, wherein the at least 
substrate molecules differ from each other in 
^th a restriction endonuclease; 

™="°"";.ru;ati:: the .ixture to generate a library of 
"-""^"^"^.ncreernro^ selecting the products of (h, for a 
aesirea property, -a^^^ a recombinant OH. substrate molecule 
encoding, an evolvea protein^^ 

^o,^ bv a DNA substrate molecule by 
evolving a --^--^ iLrand second ™. substrate 
recombxning at least fir ^^^^^^ substrate 

molecule, Wherein the "J_-='J ^ nucleotide 

molecules differ fro. °f .o^rising: 

and comprise ^="-^."7";/; .roagonucleotide PCR primers, 
,a, providing a -t of °U9 

con^rising at least one primer / with 

prin«r sequence is complementary to at least 

another -g^nt^ ,He at ^-t a first 

, and second 0» substrate molecules with the primers 

,a) in a polymerase chain , to generate 

(c) assembling the products of step IDl 
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(d) screening or selecting 
.esirea ^^<^^^^^^ , .econ^inant BN. substrate .olecule 

f .o. (d) encoding an -Ived P-^in- ^^^^^^ ^ 

A further aspect of tne sequences 
r^Miation of DNA fragments for muc 
enriching a population o 

,..,»ents to generate ^^J-^^^^.^lranaea ..a^en. 
fragments in ».tach .t laa 

o£ about 20-100 bp, . J ,„„ j„g„ents having a mismatch 

^":rrgene»!e a pool o. D» fragments 

affinity matrix to 
on an ax J- J-^^-^'--' and 

enriched ^trproaucts c£ <c> to generate a. 

(d) assembling f ^ t^c 

^ * Kof-rate molecules, 

library ot reco*in.nt =f-^'f„^^„,,„„ ,3 a method for 
further aspect =£ the ^^^^^^ 

evolving a protein ^ ^/^/....^a D» substrata 

recombining at least a £.rst substrate 
^lecule, "herein the at o£ about 10 to 

-T:;:- ayrrise denned segments, the method 

100 base pairs <aw 

comprising: . „glons o£ homology in the at least a 

,3) P-^°"^^"\';/;^„ molecules by inserting an 
arst and second ^ defined segments, 

intron sequence between at 1 substrate 

.lecuies ------ ^ 

the introns; .^^lecting the products of (b) for a 

(c) screening or selecting 

— -r/eioreting a -----rri: ^^^^ 

- - Trrhefarplcrort:^ I-tiln i! a method £or 
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^ v,v a DNA substrate molecule by 
evolving a P"";";;^ Ifana Teccna ^ s*str«e 
recoTnbining at least a i ^^^^^^ substrate 

molecule, wherein the at leas ^^^^ nucleotide 

1 /qiffpr from each other m 
molecules differ tro comprising: 

.„a con-prise rouUclectiae PCH primers, 

T: oreacfseA a pair o£ primers is 
„.erein for each strand c each 

provided, one ^.iaging tha junction at the 

ena o£ the segment and the ct ^^^^^ ^^^^ „^ 

other ena ot the seg»,ent, with ^^^^^ ^^.^^^^ 

— Trserorpr:r is Pro:iaea .or each o. the .t 

- rs::::h::rrh^t^i^^^^ a .irst 

lb) amplifying the segments ^ , (a) in 

,„a second substrate molecules with the primer 
, polvmerase^chain «action,^ ^^^^^^ , 

Of »^ cf <c, for> 

aesirea P^Pf"/' , ,,,,^i„ant substrate molecule 

- - -rr airrs rnrrirrrth. for 

^ further aspect ^ ^^^^^^^^ 

, re::r:r:rt::nsin:oL a substrate moiecuie, 
— p-^-resittrsHr^^^^^^^^^^^ 

each oligonucleotxde comprises degenerate 
:rirercrargerrr r:-: r^^^^ ^ region Of an ammo 

acia sequence of the P'°'^""' oligonucleotides into a 

(b) assembling the set of olig 

library of full ^^^th genes ■ ^ ^^^^ 

TrrertirpLducts of .C for i^rovea 
" expression of the^protein, and^^^_^^^^ „,,„i. 



encoding an evolved P"«f 'tl;e„tion is . -thod for 

. further aspect ^^^^^J^^^^^^^ ^ , eu.s«ate 
optimizing expression ot a p „^,„i. the DNA substrate 
:.ecuie .V ---^J- - ;r:re:"ator and a fusion Of a 
^lecule „uh a !>» sequence encoding 

™. sequence encoding ^^^^ 

, Uc >'-*-"^/-7„;„,„, , ,„3t ceil With a Uhrarv of 

.utageni.ed DSft substrate ,„c^ed by 

(b) inducing expression of the p 

Che library of (a) ; product of lb) , 

(c) preparing an j„„ complexes 

(d) fractionating insoluble protei 

soluble protei-nd- „„,ecule encoding an 

evolved protein from Id) . i„,e„tion is a method for 

ft further aspect of the ^^^^^^^ ^ ^ 

evolving functional °' , cBA sequence 

substrate -1""^= ..^7"";",^ ,,^ence encoding filamentous 

— f:sr p:otiir:::rd rrrary of 

particles expressing a fusion p 
, «».---;rrrnrf:rr infectious partides 
...paying the fusion pro«i.,__^ P«--- ^-^t" ^ 

T and for the protein; and 
.utant protein using a Ugan ^^^^^,-,3 .^coding an 

(a) -=-^^7^,^ .^"\rrif ied particles of (O • 
evolved protein fro. affinity , method for 

, further aspect o he^ ^^^^^^^ ^ ^^^^^^^^^ 

optimizing expression of a P ^ ^^^^^^^^ ^^^^^.^ 

molecule comprising a fusion dNA substrate 
protein with a lac -^^-^^ _tor, the method 
molecule is present on a first p 



35 

comprising: 



ho=t cell transformed with the first 
(a) providing a ho=t oell ^^^^ 

vector and a second vector ""^^ f ^J/^tf,,, operator, 
least one chaperonin "^/^.^ of (a) ,■ 

frrtiratrn/rnrn.^ Prote. fro. co.le.es 

• . further aspect of the ^/.^ sl^Le 

a nrotein encoded by a ujna 
opti-izlng expression of a P ^^^.^^ 

„„lecule con«>rrsin9 a f"""" i„ the fusion is 

protein with a filamentous phage gene „„i„ 
carried on a phagemid comprising a library 
stents, the "j;' ,,11 producing infectious 

^"i:ra'fus o prt Sin encoded by a library of 

particles expressing a fusion p 

-"-^^i™:e:rrrnrrrr.:.ectiou^ 

'° • = liaand for the protein; and 

p„tein using . U^g-^^^ ^^^^^^^ chaperonin 

£rom affinity purified P-^^f;/^^ j:l,,,„ ,3 a method for 
. further aspect of the n ^^^^ ^ ^^^^^^^ ^ 

cptimi.ing secretion fjJ^J^^^^^,^,^,.. 

..coding Cluster of genes encoding secretory 

functions; . ^ second sequence 

.eco^inin. at ea 

30 in the gene cluster of ^^^/^^^ f.om each 

at least a first and ^^^^^^J^^ ^,,,,,,e a library of 
other in at least one nucleotide. 9 

recombinant sequences; culture with the 

(c) transforming a host cexi 

,b) wherein the host cell comprises a DNA 
35 products of (b) , wnere 

sequence encoding the protein; screening or 

(d) subjecting the product of (O ^ 
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3 s.cr...ry function fro. the P^^-^^^^^^^ , for 

A further aspect of the m 

(a) providing a library ot 

„o.ecuUs enooain. "^^f^^ "ul transfectea »ith <a, 

.creenxng extracts 

3„d co»parin9 activity ^^j^ "/^/^^f ,„,,„.te mol.cuXes fro™ 
„ccver.ng .utant D ^^^^ i^,„,ea 

,eUs in lb, expressing ;«-;^;;^/^,,73na 

activity over «"^-^^ ^^^.^bstrate ^Xecule enccaing an 

(d) recovering a 

evolved polymerase fro. ^ ^^"^.^ ,s a method for 
A further aspect of the ^^^^^ ^^^^ 

n • ^ =, DNA polymerase with an erro 
evolving a DNA po y comprising: 

of wild type ' ,y of mutant DNA substrate 

(a) providing a ^^^"^^ ^^^^^ ^ host cell 
molecules encoding 3 revertible mutation. 

, comprising an indicator Seated by the mutant DNA 

therein the indicator gene is rep 

polymerase.^^ .ereening the products of U) for. revertants of 
indicator gene. ^^^^^ ^^^^^^ ^^.^trate molecules from 

--^""^^;rrecovering a DNA substrate molecule encoding an 

evolved polymerase from jfJ^JJ^Jon' is a method for 
A further aspect of the m 
30 evolving a BK. P°^V^erase co^pr,""^;^^^^^ 
(a) providing a library 
molecules encoding mutant ™. polymerase, 

cot»prising a Pl==»^^^"'°';„,a preparations and extracts of 
preparing plas-a P P ^ 

„ h„.t cells transfected with the pr ^ ^^^.^^ ,„ , pcR 

amplifying ^^'^l^^J^^ ^J,^ ^ that plaB-id, 
. reaction using the mutant polymerase 
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Z .cue... encca.. a„ 

' irieri Phosphcna^se fro. a phosphonatase 

(a) providing library oi mu , . , 

1. the library comprising a plasmid 
substrate molecule, the iiorary 

" expression vecto-^^^^^,^^^ ^ ^^^^^ ,,,, ph„ operon 

is deleted, ^^^^^^^^^ ^^^^^^ „,„,fectants of (b) 

.m. a p-n---°\^„r:rors:.::r:« ^^^^^^^^^^ 

(d) recovering tne ujnm 

evolvin, a protease encoded by a substrate .oleoule 
--^^'"^U, prov.l„. Ubrarv o™t. of tbe^- 

..strata ' rJ^rrfrt-e Leoule Is 

expression vector, wneren 
25 linked to a secretory leader; 

rere::i:r .r^; of t. transfe=ta„ts of . 

- ' -"^Td-Terotrn:" Lstrate .olecule fro. >o, 
30 encoding an evolved proteas ^^^^^^^^^ ^ 

. : Ubrar; o protease .utants displayed on a pha.e 
screening a library J 3 substrate 

to obtain an Lproved protease ^ 

molecule encoding the P^'-^J^^/^/^t fusion protein, 
3S £ila.entous phage protein to generate 

. comprising-. , „.n. exoressing the fusion 

(a) providing host cells expre. 
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protein; a protein net to 

(b) overlaying host cexxo 

entrap the phage. ^^^^^^^ „ „cover phage 

liberated .y digestion ^^^^ ^2"^ prl=t of <=) . and 

rrher aspect o. ^t:^^:^^^^ 

screening a library o£ protease nmtants 

protease, the '^'""^ . „j peptide substrates, the 

P«'l^-^=/_ff ;^;i;p,Le and a fluorescence 
peptide substrate comprieang a fluor p 

quencher; nv,v,rv o£ protease mutants tor 

..i.i. Jlirtrpe^idrarbl^rate:, .herein nuorescence 

is measured; and protease 
(c) recovering DNA encoding at lea 

n,utant from (b) • invention is a method for 

A further aspect of the in 
° , V ^«^^•rferon gene comprising: 

evolving an alpha ^.^ant alpha interferon 

ta) providing a ^^^^^/^ ^^^^^^3 ^^.ge vector; 
library comprising a filamenco f 
genes, the library f comprising a reporter 

(b) stimulating cells compr ^ ^eoorter gene 

^^rict-ruct comprising a reporter y 

rerc:;r:rorri::err:: responsi. propter, and 

' HLx::rtV^ra;d 

.ecovering phage - th P^^^^_^ ^^^^ ^^^^ 

(e) recovering an evoj-vcv* 
30 ^ 

product of (d) . invention is a method for 

. : rr-nLt a ™. substrate encoding a 

rrrtri: an e.ol.ed - 'rn^rrhe Ubrary 

(a) providing a library uj. . 

comprising an expression ^ell with the 

(b) transfecting a mammalian hos 



10 



Ubrarv o£ ,a, . whe«i„ ^u.anC protein is expressed on the 
.urface o£ ^ ^.oaucts of wit. 

, a U,a„. ™ta„t protein .ro. t.e 

' P«^-" °;r::;oferin, an evolved BKA s..strat. fro» the 

products of <dK ^^^^^^.^^ , 

.0 evolvin. aCsuhstraL molecule encoding an interferon 

alpha. ^ ,f „„t=nt alpha interferon 

(a) provxui y ^^nression vector wherein the 

,enes.. the ^^^^^^ . ^^^ZZZsZ^Z the control of an 
alpha interferon genes are expresse 

,5 inducible P-"-;- ^^^^^„^ ,,3, ,,us with the library of U. , 

ccntactm, the product of ^^^^^^^^ 

(d) recovering DNR encoding a nmtant alpha 

v,„=,- cells surviving step (c) ,• and 
interferon from host cells^ interferon gene from the 

(e) recovering an evoxvcv^ 
20 ^ 

product of (d) . invention is a method for 

A further aspect of the mven substrate 
of a orotein encoded by a uin^ 
evolving the stability of a pro ^^^^^^^i^g a fusion of a 

„olecule. the »Y-rh:"pr:rn":tr:T. silence encoding 

" Hirar^sThr^^^^^^^^ — ^ 

method ^-^^^^^^^ ^ „u expressing a library of 

.utants <^Y-J-^-:;;rf"y;ng the mutants with a Ugand for 
" .he proteil wherein Le ligand is a human serum protern, 

the affinity ;re'encoding the protein 

(d) recovering an evoj-vcv^ ^ 

" from the product^of .c.^^^ ^ 
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^ i^s<;t two subunits, comprising: 
evolving . protein havxng at least ^^^^^ ^^^^^^ 

(a) providing a iiDrary 
rrolecules for each subunit, library of 

,W reco^bining the Ubrarxes r 
, eingle chain conat«cts . /^^^^ each 

TZ l:rra ::"cr;;e:ence^nco.ing .h. carbo. 

.0 terminus of a — products of (B) . 

(c) screenxng or ^-^^^^^J ^^^.^ construct 

(d) recovering recon>bxnant^sing ^^^^ 

DNA substrate molecules from to mutagenesis; 

(e) subjecting the products ot 

15 . evolved single chain construct DNA 

(f) recovering an evoivea a 

substrate .nolecule from (e) _ ^ a method for 

R further aspect of the i ^„^^„„e receptor 

evolving the coupling of a — 'l-^J-^/^^^...^,, 
„ 3 yeast Signal "-s.uct.on pathway c^^^^_^ ^ ^^^^^ 

-press^ngaU^ra^^^^^^^ 
protein mutants in a host cell, ^^^^ptor and a 

expresses the mammalian ""'-"^^^.^ed under control 
reporter gene, the receptor gene geing exp 

of a Pheromone -'P^-^/'^^^'J^l^, ,,e products of (a. for 
t=ergre m tl presence of a ligand 

- - an evolved . alpha 

30 — r ::;::if":rr- 

* f-ther aspect 
reccrijining at least a first ana 

...ecule. co---;^^,,„^ , _ .it-t -.^^^^^^^^ 

... .econd ... substrate ^ host 

and second DNA substrate molecules 
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cell; 
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from (b). invention is a ""hod for 

ft further aspect of the 

' evoivin. a --'""-'Crate c^'ses'a vector, the 
interest, wherein the ^^^^^^^^^^^ ^ .etho. co.prisin., 
vector comprising .,i,.,tra„ded vector DNA and a 

(a) providing sing ,v,«rate sequence,- 
, U.„rv of -tants - ^ ^Ild f- the iibrarv 
ly,) annealing smy-^ c . 

vector DNA of (a) ; 
,a, to the =in5l^;"f/;t;;;Laucts of lb) into a host,- 
"ansfor^ingjhe P for a desired 

(d) screening tne px^^ 
" Propartv,- and ^^^^^^^^^^ ^^^^^^^^.^ ,„,3trate ^ f- the 

products of Id) • 

«i-*f^t1ir^t^^^;:r;^go PCR primers 
ngure 1 *=^-" f intestinal aUaline phosphatase, 
'-n:;:: rre^ctftn: alignment Of alpha interferon 
amino acid and nucleic acid sequences. 

DescriEtion_ol_yae_S±! 77777 strategies for 

r.T:^:^^^:^^^=^^^^^ methods, 
evolving polypeptides ''>""f "f^^ invention can 

in so« e^^--";. .f;;"::a!.e grain shuffling" and -fine 
generally be classified as ^^^3, 
30 grain shuffling." « ^-^"^'^ ^^/^ ^i.uations where some 
Lrategies are -^^^^^'^^ ,s available regarding 

,,„ctural or functional informal ^^^^^.^ ^^.^ ^„ 

the polypeptides of screening of many 

manipulated is large, when sele ^^^^^^^^ 

„ .eco^inants is ...Hange or recombination of 
shuffling- generally invd functional 
segments of nucleic acids, wheth 
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e restriction endonuclease fragments, or- 
domains, exons, restrictio ^ 

u^*-r=vilv defined segments. Fine 

variation cccurlng "^^^J ^^^^^ Z..o.,l both 
„,„ed -searching o« J'//^^,,, ^aUtativ.ly 

"Chnl^ues are '^^^^^ll^^^'J;^'^^ ...^hes are otten 
different. For exan,ple, coarse g 

nrr:::^" Irche. are Often 

polyKetide operons, ,3 protein expression 

optimal for optimizing a property 

using codon usage libraries. ^ evolution of gene(s) 

strategies .-ra lly e tax ^^^^ ^ 

- -^r"'^eU : i^o -t Of function In a homolc^ous 
heterologous cell or ^^^^^^^ g„erally by a 

- heterologous ce 1^ ^,,,^i„,,i.n. Recursive 
process termed recursive q different 

.equence "/--f-:;/:; rl:, as described in further 
formats and permutations of principles, 
detail b*lo.. These formats =° 3uccesslve cycles of 

recursive sequence -^^^"""^^"'"/^rity. i.e., the 
„co^lnatio„ to -"-^^^f;" /riec^le^ shoeing 

„eation of a family ^^^Z l each other but differing In 
; substantial sequence '"^'"""^^ recombination cycle Is 

,.e presence 'n/t^eening or selection for 

TLra detlrercb "Cterlstlc. The molecule.s, 

„„,lecules having a desire . materials for 

-'-'T "rerrt; i "Tound^ - - given cycle, 
0 generating diversity ^.^^^ furthermore, 

recombination recombination can be augmented in any 

diversity resulting J mutagenesis (e.g., error- 

hy ^PP^^-/^^f^:°:: ^esls, passage through bacterial 
""trams trellnt with chemical -tagens) to either 
" r fuL r at s -r or products of reco^ination. 
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evolution, or ^ :^rc:-penaing ap,xicaticns 

p«sen. inventors „s/x,8,431, niea February 

serial Ho. OS/«S,SM file .P- 

Os/S3,,S,,^filea „,,,,,,3.., .ilea Marc. - 

jiled NoveirO^r 30, l"^'^" 25, Serial 

25, 1996, serial «o. 08/621,430 fil^ Serial Ho- 

«o. .cT/«s,e/os4So, niea .pr.i 1,1 9^. ^^^^ 

„e/.S0,40O, „ .ue* Septe„a«r27. 

-I Q iQQfi Serial No. uo/z-^J-f # . 
July 3, 199b, bexxa v>^^ 07 1996; Stemmer, 

,„6, and 08/722,660 filed Septe.^r 27, 

270,1510 <"';";"™:\f :;5f^^„,, St...er, 
„,,S, ,. Ste-er S^~^^",„r4,-107Sl (1994, ; S.e™.er, 

^ 370:3SS.391 ^^^^^^.^.iJi^i^g^lS- 

-^^"^^ ::::r::\er. ..e„e.. 1= used 

- " - --ne::aT;re%::a::dTrraTar:^ o. 

; biological function. Genes^can , j„t^rest or 

source., including clonin. ^J2:SZ:L information, and 
synthesizing fro. ^'^JJ^^^^lX.. desired parameters. 
* »ide variety of cell typ ^^^^ 

,0 recipient of -1- gra.-negative and 

,„.a„de .any bac-- c^l ^^^^^^^^ ,„epto..cetes, 
gram-positive, sucn a penlclllium. Bacillus, 

«tlnowceces, corynebacceria, "^^^t^^'j,/ j„inja . Cells 
KsCerichia coli, Pseudon^nas, ^f^'^^^^' 

,5 of interest also ^^te, human, , both 

T- ^ ,-on«5 (e a., mouse, namstex , ^ 
mammalian cells te.g.. include stem 

cell lines and primary cultures. Such 
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Xy^hocytes, Chinese hamster ovaryJHO . .ou« 

™' : "tus orr.;r «ns, SUCH as 

'"'rTce " o"o„, sovran, sugarcane. .o.ac=o, ana 
, maize, rxce, whe , |pe„icllll-, n.sa»», 

arabiaopsis; fish, algae, rung .„„.t- yeasts (Picchla 

;,spargilJus, Podospora, Neurospora) , insects, ye 

.„a aepena c„ a nu^r of ^ ^ 

! ^h. intenaea use of the engineerea host, 

0 factors, aepenaing on the intenaea ^„^^„„„e„„i 
incluaing pathogenicity, substrate range e 

v^^^ r^f kev intermediates I ease uj- y 
.arainess promiscuous transfer of 

^nlpulation, a„a l^^^^^^ „3,„i.n«. K preferred host has 
genetic information to t^ins of 

. vH. ahilitv to replicate vector DMA, expre y 
LS the ability ^ proteins of interest- 

interest, and properly tr r lactobacllll, 

particularly advantageous ■'-"/^/^^fj ,,„aro.ycas 

"---:r Pi::r::Ss, tLider ecus, .ceus, cos 

.0 reus and transformed B ceU lines such as SP./O, 

J558, NS-1 and AG8-653. ^^g^. 
The breeding procedure starts wxtn 
. .hat qenerally show substantial sequence identity 

to each other <^-^-' ^^^^^ other at certain 

25 sequence identity) , but differ mutation, for 

positions. The difference can be ^^^^ ^ ^,,3,, 

. example, substitutions, ';'f;;,,haps 5-20 

different segments differ ^_^o. e.o^ o^ m pe P ^^^^^^^^^ 
For recombination to generate incre^ 

3„ rera:rto re rrierar^^^ 

™- ^""-^ r Ifir reria:: t„o substrates, there 

positions. That is, „^c-it-ions If there are 

- - - -:;r rs^t-e can differ from 

..ree ; ;;:::,i;n, and the second can differ 

35 the -~"^;=;;/"/'Jj^„ent single position. The starting 

can be na ral variants of each other, for 

DNA segments can 
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allelic or sp.cies variants. The segments can also 
reT^nrUeUc .„es .o„in. so„e - 

rtiiricrs ca^ ..o - rce:\rto.- 

other. PC. e^a^ple, one :X:ZsU...io. of a 

Tarrtrxl - can^Uo ^ .epa^ea .V 

„utageBic cassette. .,^g„ents in a nnitagenic 

propagating one (or both o£ the g ^^^^^^ 

Tn these situations, stricT^xy . v -p 

straxn. In these ^ ^^^^^ ,^^,1^ ,f 

PNA segment xs not ^^^^.^.^ the startxng 

related segments. The dxtr .^bgtantially. the same 

— - - o-en the sam n t,^o. ^^^^^^ 

length. However, thxs need n segments can be 

3 segment can he a -^-^--^^/j^/.^Tuch as vectors, or can 
present as part of larger molecules. 

- ^" ^-^rs^lng segments erh^eir to 

- — ; TeTiLXTr—r/Jseg.^^^ S.oh 

° T^r- T s"°;:r.rtL' sti-in: 

mS in9 or 10^^ members. In gener<ax. 
more than 10 . 10 . or x generated include full- 

segments ana the --^^-^ ^sl I^'l -^^""^ 
length ^^^r^ polvaaenylation se^enoe, 

25 sequences, such as a p 13 „ot the case, 

required for expression . "owever ,„3,rted 
. ,He recombinant D» segments -^^;;f^fj33^,„,e3 before 
into a common vector providing the missing 

performing -='^^'''"^'''^Z'''sZer,c. recombination format 
3„ " '^"^ ""'"''^ library of recombinant 

,^l,,ed is an in -vo «o-t, the Ubrary ^^^^^^ 

segments generated already e. 

the cell type in "^rch expression ^^^^^^^ ^^^^^^^ 

substrate specificity is ^ „combinant library 
35 reco^ination is P^^ormed n ^^^^ ,,,,,e 

rrn^sirn^""— ^ of the reco^inant library 
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can be linked to an episome or virus before introduction or 
can be introduced directly. In some embodiments of the 
invention, the library is amplified in a first host, and is 
then recovered from that host and introduced to a second host 
5 more amenable to expression, selection, or screening, or any 
other desirable parameter. The manner in which the library is 
introduced into the cell type depends on the DNA-uptake 
characteristics of the cell type, e.g., having viral 
■ receptors, being capable of conjugation, or being naturally 

10 competent. If the cell type is insusceptible to natural and 
chemical -induced competence, but susceptible to 
electroporation, one would usually employ electroporation. If 
the cell type is insusceptible to electroporation as well, one 
can employ biolistics. The biolistic PDS-1000 Gene Gun 

15 (Biorad, Hercules, CA) uses helium pressure to accelerate 

DNA-coated gold or tungsten microcarriers toward target cells. 
The process is applicable to a wide range of tissues, 
including plants, bacteria, fungi, algae, intact animal 
tissues, tissue culture cells, and animal embryos. One can 

20 employ electronic pulse delivery, which is essentially a mild 
electroporation format for live tissues in animals and 
patients. Zhao, Advanced Drug Deliverv Reviews 17:257-262 
(1995) . Novel methods for making cells competent are described 
in co-pending application U.S. Patent Application Serial No. 

25 08/621,430, filed March 25, 1996. After introduction of the . 
library of recombinant DNA genes, the cells are optionally 
propagated to allow expression of genes to occur. 

A, Jn Vitro Formats 

30 One format for recursive sequence recombination 

utilizes a pool of related sequences. The sequences can be 
DNA or RNA and can be of various lengths depending on the size 
of the gene or DNA fragment to be recombined or reassembled. 
Preferably the sequences are from 50 bp to 100 kb. 

35 The pool of related substrates can be fragmented, 

usually at random, into fragments of from about 5 bp to 5 kb 
or more. Preferably the size of the random fragments is from 



.0 .P to .000 .P, pre.e.a.Xv l^e/l 

.„,.e„ts is a.out a .p ^P;^^^^_ ^^^^ 

be digested by a nuni.er of d^"^" restriction enzyme 
o. R»se digestion, "r,!, ^^^.ents o£ a 

digestion. The concentration of nucleic a ^ ^ ^ 

particular length is often less "'^ /^..^.i, 

, arid The number of ditrerenu ^ 

the total nucleic acxd. T ^^^^ ^^^^^ 

nucleic acid fragments m the mixture 

about 100, 500 or 1000. ° fragments are 

The mixed POP^-^^" ^ ^ "l oo preferably 
denatured by heating to ^ ^J^^^^^^ -leic acid 
r Is' "si le St anded nucleic acid fragments having 
fragments. Single ^^her singl^"^"^''^^^ 

regions of sequence sealed by cooling to 

nucleic acid fragments can '^^^^ c. 

o . nz,o c and preferably from 40 c to e 
20° C to 75° C. ana p the addition of 

Renaturation can be accelera concentration 

-I -.^T /"PPG'M or salt. J^"^ ^'^•^ 
polyethylene glycol ( peg ) or ^ preferably the salt 
L preferably fro. 0 * to .00 ^ .ore pr fe y ^ ^^^^ 

concentration is fro. 10 "^^ __„ation of PE« is 

.alts as «,,.S0 -t^J^^rably £ro..S* to 10.. 

preferably fro. 0, to 20. .or p substrates, 

— rarerulracia .rag^nts ar. incuba- in 

, presence of a -"/^ H-'e^ -P I^.^: 

— r l^rio;; r■se-ridentity are large, .a, or 
:rhe: hi;h.te.perature Polymerase can be use wr h 
annealing te.perature -^^^^^^^^^^ 

'0 identity are "J^^^^ 3„ ,„„,3Ung te.peratur. of 

polymerases can be used wit 

between 20.30-C. The ^'^"H' ^^Mn,. simultaneously with 
nucleic acid £ra9.ents prior to annealing, 

annealing or after annealing. renaturation and 

The cycle of denaturation, presence of 

, ^lo-i/- acid fragments xn cne pi^^* 
incubation of rando. -shuffling- of the 

polymerase is someti.es referred to 
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„u™ber of ti.e.. Preferably the =VC>.e " -Pe' ^„ 
100 ti»es, ™ore preferably the " ^ ,,^1^- 

,0 tl.es. The resulting nucleic acrd a e , family 

preferably fr« 50 P ^^^^^ ^^^.^^ .^stantial 

warrants °' ,13, aivergin. at several 
.e^ence xaentrty ther ^^^^ ^^^^ „,*,rs than the 

posrtrons The pop „ j„g„ents resulting 

starting substrates ^ P P ,„pufled by PCR, then 

from recombinatron rs P"^-'"^^^^ ^ ^^^^^ ^«ure 

cloned into an appropriate vector and 

used to shuffUng, subse<rience» 

" - Z be generated by a^l«Ying the 

„f reco^lnation -""""^ ""aitions which produce a 
£ull-length sequences under condrt ^ ^^^^ 

----'tX'^'^^-^- d d 

:urf:rr^^^^^^^^^^ nrurr arrbrera trriLt , 

-^":ri:ralTyre"ofTearaU:g^n^ amplification. ^«.ls 
rrirr^ereln at ^ ^ ^ 

amplification provides a .er ng " m the subsequent 

extended products, is termed p,,aucts anneal 

. a^Ufication round '--^J-X /e" Le-related template 

to and prime extension on different: 

species. least one cycle of 

in a further variation, at least one y 

amplification can be ccn-^^^^^^^^^^^^ sequence, 

,0 overlapping =7^=-"""'//J,,agment can hybridize to and 

and different lengths. Each "9 fragment from 

prime polynucleotide chain /^/^ „ed 

..e collection, thus ^^^^^^.^^ single-stranded 
polynucleotides^ in a single 

35 iragments of variable leng ^ .^^^ 

primer by vent DTO polymerase on a f 1 _^ ^ 

single stranded Dm fragments are used as prime 
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containing circular sing ^ ^^^^ .^^^ ^^^^^^ 

(see Levichkm et al . , B ^ recotnbined by 

Nucleic acid sequences can be sequence 
• . ..auence recombination even if they lack seq 

=^ — can -.~;;;-r:r.ec..c 

synthesized to contain - addmo . ^^^^ 

S. to the gene (sequence M and a d. 

. 1. ro the segment (sequence B) . mi • J 
sequence 3 to the 9 ^^^^^^^ 

ie amplified us.ng a 5 pr»^ , , 3, 

complementary ="-\°*/;j„r,„.,o base sequence (C, . 
primer contarnrng ^ ^aj,„„t segment contain 

Similarly, primers for the "^^ ^ j„ ,^^3 ^^y, small 

sequences C '""^l'""""^^ ' ,3, making the segments into 

„gions of homology are subsequent to the 

site-specific recombination ca set e. q^^^ 

initial amplificatron of -^^jf „ primerless PCR. 

segments can then be "^^^'J^;"-^^^,,,, Le shuffled, it 

domarns ^^^^^^^^^^^ sequences 
„sy not be possrble to rn ^,„„ining a 

. the domains, '° "^^^ ^^.^.a, groups of 

continuous open reading frame. homologous to the 3' 

oligonucleotides are synthesized ^^^^ 

to be 

end Of the first ^"-^"/^f ^ ^^^nd L-i- encoded by all 
shuffled, and the S- ^^^/'/^f^/together. This is repeated 
,0 of the other genes to be ^''''lf^\'°'s that allow 

„ith all domains, thus P""^;"^ Jj;„^ii, maintaining their 
recombination between protein domains 
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^^mr^v-i qina variant forms of a 
collection of polynucleotides co p 

c=iiV^c=tantial sequence 
gene. The variant £c™= .suaUy ^ow s 

iaentitv to each other su Ucr nt to Jl^^^^^^^^ ^^^^^^^ 

^combination between substrates^ ^ ^. 

P-vnucleotiaes - ""^ j r^onl PC .r error-prone 

variants), induced (e.g., err h ^^^^^^ ^.^^^ 

recursive sequence recombinataon) .esynthesizing 

n-ivprsitv can also resuxu 
recombination. Diversity alternative codon usage. 

,e„es. rreLrrrci"^^^^^ — 

rrtrrs^rt tL^ination .ene at „ore .verse ^ 

^ C5t-artinq materials- j-*!^ 

proauots than there { positions, 

least two substrates differing ^^^^^^^^^ ^^j.^oB ^^^rs 
However, commonly a library ° ^ ^he length o£ 

is employed. The degree of ^^^"^''^ 

the substrata being "-'^^''"''/"''^^^^ity at between 0.1-25% 
-notional change to be --'d^ J--^^^^^^^^ 

o£ positions is tyPi"l- _ piasmids are often standard 

incorporated into P^^-^^^J^^ ^„^,,,,„p, piasmids. However. 

' Tefhrds' to described below, the pUsmids include 

in some methods to substrates can be 

mobilization (MOB) functions. The 

incorporated into ^ ^ Ifd Ta-n. aiffarent types of 
least two different types of P ^^^^ ^^i^^tlon for cells 
t selectable markers are u== 

SO I where 

containing at least two types °' ^tZ' the different 
different types of plasmid ^.^^^o^^.^^in^ groups to 
piasmids can come from two ^ „ias within the 

tuow stable co-existence "^-^^^^fj:: J incompatibility 
3„ ..,1. ^-^^^'Z:^ same cell for sufficient 
firtrauow — ous reco^i.at.. ^^^^ 
Piasmids containing diverse su . . ■ 

Piasmius „^^v,od (e q.r chemical 

J i^frs rells by any metnoa ^e.y.r 
introduced into ceiis electroporation, 

3, „ansformation, ""^^ Jj;; viral systems, . often, 

hiolistics, P-i^'^-^ J;";°f;;:,„ saturating concentration 
Che piasmids are present at or nea 
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(with respect to Tnaximum transf ection capacity) to increase 
the probability of more than one plasmid entering the same . 
cell. The plasmids containing the various substrates can be 
transf ected simultaneously or in multiple rounds. For 
5 example, in the latter approach cells can be transf ected with 
a first aliquot of plasmid, transf ectants selected and 
propagated, and then infected with a second aliquot of 
plasmid. 

Having introduced the p.lasmids into cells, 

10 recombination between substrates to generate recombinant genes 
occurs within cells containing multiple different plasmids 
merely by propagating the cells. However, cells that receive 
only one plasmid are unable to participate in recombination 
and the potential contribution of substrates on such plasmids 

15 to evolution is not fully exploited (although these plasmids 
may contribute to some extent if they are progagated in 
mutator cells) . The rate of evolution can be increased by 
allowing all substrates to participate in recombination. Such 
can be achieved by subjecting transf ected cells to- 

20 electroporation. The conditions for elect roporat ion are the 
same as those conventionally used for introducing exogenous 
DNA into cells (e.g., 1,000-2,500 volts, 400 /iF and a 1-2 mM 
gap). Under these conditions, plasmids are exchanged between 
cells allowing all substrates to participate in recombination. 

25 In addition the products of recombination can undergo further 
rounds of recombination with each other or with the original 
substrate. The rate of evolution can also be increased by use 
of conjugative transfer. To exploit conjugative transfer, 
substrates, can be cloned into plasmids having MOB genes, and 

30 . tra genes are also provided in cis or in trans to the MOB 

genes. The effect of conjugative transfer is very similar to 
electroporation in that it allows plasmids to move between 
cells and allows recombination between any substrate and the 
products of previous recombination to occur, merely by 

35 propagating the culture. The rate of evolution can also be 
increased by fusing cells to induce exchange of plasmids or 
chromosomes. Fusion can be induced by chemical agents, such 
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as PEG, or viral proteins, such as influenza virus 
hemagglutinin, HSV-1 gB and gD. The rate of evolution can 
also be increased by use of mutator host cells (e.g., Mut L, 
S, D, T, H in bacteria and Ataxia telangiectasia human cell 
5 lines) . 

The time for which cells are propagated and 
recombination is allowed to occur, of course, varies with the 
cell type but is generally not critical, because even a small 
degree of recombination can substantially increase diversity 

10 relative to the starting materials. Cells bearing plasmids 
containing recombined genes are subject to screening or 
selection for a desired function. For example, if the. 
substrate being evolved contains a drug resistance, gene, one 
would select for drug resistance. Cells surviving screening 

15 or selection can be subjected to one or more rounds of 
screening/selection followed by recombination or can be 
subjected directly to an additional round of recombination, 
"Screening" as used herein is intended to include "selection" 
as a type of screen. 

20 The next round of recombination can be achieved by 

several different formats independently of the previous round. 
For example, a further round of recombination can be effected 
simply by resuming the elect roporat ion or conjugation-mediated 
intercellular transfer of plasmids described above, 

25 Alternatively, a fresh substrate or substrates, the same or 
different from previous substrates, can be transfected into 
cells surviving selection/screening. Optionally, the. new 
substrates are included in plasmid vectors bearing a different 
selective marker and/or from a different incompatibility group 

30 than the original plasmids. As a further alternative, cells 
surviving selection/screening can be subdivided into two 
subpopulations, and plasmid DNA from one subpopulation 
transfected into the other, where the substrates from the 
plasmids from the two subpopulations undergo a further round 

35 of recombination. In either of the latter two options, the 
rate of evolution can be increased by employing DNA 
extraction, electroporation, conjugation or mutator cells, as 
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Tn a Still further variation, DNA from cells 
described above. In a ^ extracted and subjected 

surviving screening/selection can be extrac 
surviviny ^pnuence recombination. 

- ^-rrrterL. ^^^^^^^^^^^ 

.„„aitio„s increasea — ^ J/J 

rounds c£ reco,nbinstion and select! / ^^^^^ 

,„.o.ed usin. -j;:/rol"n ion selection/ 

::^,„.d substrates evolve towaW 

• *! r^ilred phenotvpe. Typically, in this and 

acquisuacn o£ a deeded P ^.^.^ ^ o£ 

other methods °£ "-'^^ ,„ired phenotype differs 

recoinbination that has acqu positions and has 

starting suhstrate - J^^^^^^ ,^ ^ , 

..olved at a "te rae 9^^^^ ^^^^^^ 

least "° ; „,t„rally acjuired notation of 

rate of -"^""^ f/^^^'^^^^.i^^, per generation (see 
ahout 1 ^utat- ^^^J^^^^^^^^^^^^J^. S3...0S-907 

jmderson et al., Erfia, — k. transferred to another 

,1„,„. The -final product- may ""f f ^^j,^,„ 

,ost ^re desirahle for ^^^'^^^^'^^l^^ „,ere the .ore 
This is particularly '^^""^^^/^/^'J^j^r the .any cycles 
desirable host is less ^^^oular 
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. L. ,^::::^^;;ri;^in;^as.id recon^lnatron 

The. reco»*ination,- usually, 

can also be used for virus p ,„„e additional 

pbage.plas.id reco^rna ron^ Howe e , are appropriate, 
comments particular to the ^oned into both 

..e initial substrates for „,,,„i „bich 

r ::t:"rrstrrn:::ted i^^^^^^^^^^^^^^ 

^teTsrrrhe p^d. . before, 
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^ ^„r^•ir■allv contains , a selective 

carter. The plasmd and J^^/ „„„,,er, a 

into cells by transtection as ^ ^..^ 

„..e efficient procedure is to transfect the ^ ^^^^^ 

-. f i--ranc:fectants and infect tne uxaii 
plasmid, select "^^^^"'"^^.^.^^^^ ,f infection of many 
with virus. Because the transfected and 

^- ^^ nnns- nf cellS, mOSt ceiXS uj-exii^ 

viruses approaches 100* of ceil , 

infected by this rout, contain both a plasmid 

..„in. ^^""-rrrornation occurs between plas^d and, 

,i„s .eneratm/both — ^ r»hich 

-"T. ::r;hr::rirnrore"that rapidiyuus ceus, 

provided that the viru ^ electroporation or 

recombination can be ^"^'"^"'^^^/Y^w.en cells. Recombination 
^vansfer plasraids between ceiis. 

conjugation to „j ,i„s by allowing the 

can also be augmented for so„e yp ^^^^^ ^^^^^^ 

progeny virus fro» °- 3,„„ ,,,i.tance to 

types of virus, virus infected - ^ ^^^^^ ^ 

superinfection. However, such '"-'^^ ^^^^^ 3„,i„3 of 
infecting at high multiplicity ^^J^^ „ „^„,ed. 

t.e Virus in -:f,:r.trngX^^^^^^^^^^^ cells 

.ne result °* so.e viruses, 

. „ith virus depends on the nature ^^^^ 

,uch as filamentous phage, stably exist ^^^^ 

ceU and also -^/--rngrcLr ^^^^^^^^^ "-y exist 
Viruses, such as lambda having progeny virions. 

,n , cell li.e :^T:,^Z^^ ulda. undergo 

Other viruses, such as the P 9 ^.^^ ^^t 

recombination with the plasmid ^^^^ ^^^^^ 

cell and ^-"°V pUsm d ™- /^^^^^^^^^ ,eco*inant 
„ithout WlXing the host^ "/^/^^.a/selected using the same 
pusmids , rJco-ination. Progeny viru, 

approach as for vl^sm^oj ^^^creening can also be 

m subse^ent rounds of 
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rr,r- viruses ttot kill their host cells, 
reco^lnat.on. J^'^JJ"^ „ccn*inatio„ reside only « 

reco*inant genes "-"^^ ,,ie=tive assay 

the progeny v.ru ^ I '^o^.^'r,. genes in a cell, the 
requires ^''P"^""" °* "~ „a„,£erred from the progeny virus 
recc^inant genes should ^= ""J^^^^^^^^, retransf ected 
to another vector, e.g.. a p performed. 
,„to cells before -^^^ " ^products of recordation 

'":nel ur^ ;i g rLo^ination and in phage 
,re present .n both eel su ^^^^^^ ^.o^lnant 

extruded from these cells _ T ^^^^^.^.^ 

products provides .^f;;;°"^^J^,,^ie, can be 
plasmld-plasmid ""/^-^^^ , „»nd of in vitro 

isolated from ^^'^H^^^^^^ the progeny phage can be used 

recombination. ^'"^^'^^J a previous round of 
5. to transtect or ^^^.^ transfected with fresh 

screening/selection, or fresh ce 

substrates for recombination. 

, n-vtrns '-^'nation 

The prrncipx virus-virus 

■I Tfiral recombination can apt' n 
plasmid-viral re „„eif ications. The initial 

recombination »ith a Eew m ^iral vector. 

v,„v,tes for recombination are cloned inco 

substrates tor substrates, 

usually, the same vector is use ^ ^^^^^^ 

preferably, the ^ ' ^^^^^ i,,ertion, some viral 

.utation, does not kill cel^ ^ packaging cell 

genomes can be packaged ^" ,,us at high 

Une. The Pa-.ad^--- "Z:^^, ^,,,,,,uty that a cell 
^Itiplicity ^^^J^;2.s.s bearing different substrates 
30 will receive multiple infection, subsequent 

1""^ fhl Tatu e o infection as discussed in the 
steps depend on -^-^^'^ vi„ses have phagemid 

previous section. ^^^^^ p.ess, i.S.. genomes 
(Sambrook et al . , l^a-— - phagemids, the 

35 such as lambda cosmids ^^11 and undergo 

phagemids behave as plasmi s wi Recombination 
recombination simply by propagating the 
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■ particularly efficient between single-«ra„d«i for™, of 

IS particuiarxy augmented by 

intracellular DNA. Recombination can be g 

electroporation of . , ^^^^i^g, cosmids containing 

Following selection/screening, 

\n he recovered from surviving cells, e.g.. 
recombinant genes can be reco ^ ^^^^ ^^^^^ extraction 

r rrarraVocI-^nrc-ea .V «pac.a.i. cos.ia 
™* ""xf 'the viruses are filamentous phage, recombination 

..replicairU occurs ^^'^^^^^^ 
,„.ctea ceUs sex-icn.^^^^^^^^^^^^^^ 

cells infectious particle. (I.e.. 

in^roved properties, '"^eth r ^uchcells. 

phage or packaged P'-'^-f/' Jf^'tL san« as for plasmid- 
subsequent options are essentially tn 

viral recombination. 

4 ^^>.^o«nme perombination 

^TT^d to especially evolve 

This format can be used to esp ^ref erred 

, „hc:trates The format is particularly preterr 
chromosomal -^^"^J^/- ,,,omosomal genes contribute to a. 
in situations m ^^^f ^^l ideation of the 

phenotype or one -e not Know ^^^^^^^ ^^^^^^^^^^ 

chromosomal gene(s) to be sector. If the 

reco^ination are ^--a -to a Pla^.a - ^^^^^^^^ 
chromosomal gene s) to be^vol ^ ^^^^ ^^^^^^ 

constitute a family of sequenc chromosomal 
.eguence identity but -- --- ^e/have not be.n 

;\ri:rt::r:rstrt:: Usually constitute a ubrary 

located, the. "^"^^ ^ ^^^^ „,^er show sequence 

,0 of DNA segments of which y ^^^Ived. Divergence 

identity to the gene or gene(s) to ^ ,3, 
between plasmid-bome s-^-^" ,,,, piasmid- 

rrn:%rt:::erfrrrirf:r:nt Secies th. that of t. 

- ''"Th! ^tsrsTerrlng substrates for recombination 

, are transf acted into cells having chromosomal gene.s, 
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■> .Ann can occur simply by propagating the 
evolved. Evolutxon - ..^nsferring plas.ids 

culture, and can be acceler ^ion. Evolutxon 

between cells con.-^^^^^^^^^^ J^^^^ ,,,, ,,,,3 or by 

can be further accelerated by ^^^^^^^ 
seeding a culture of non.utator ^^^^ ^^,,3,,, 

...ator host cells and preferably, 
plas.ids by electroporat on or cono g ^ ^^^^^^^^ 

— host -/^^ -f^/;, rA^Ilation of a pure culture of 
selectable marker to faciin: selection/screening 

"^""eur^etrrnnrrso^e^ a„./or p.as.i.a that Have 
identities cellB bearing taction. 

evolved toward -^^-"""/^^/."^ina.ion and 

snbse^ent rounds r ^^^^^^ ^^^^ 

..lectlon/soreemng procee^ „,o^i„,tlon. For . example . 
described for P^^^'^-^^^^^^e^ed by propagating cells 
further recombination can be _ electroporation or 

surviving reco.a>ination in plasmids 

.oniugative ^'^^"^l- ^^^^^^tr r^co^ination can be 
bearing additional subatratM f ^^^y, such 

introduced into the aurviv^g ,nd bear a 

plaamaa are f ro» a plasmids to allow 

different selective marker different 
.election for /-^^^^^^^^^^^^ and/or 

plasmids. AS a ^^^^^^^ , subpopulation of 
5 chromosomal DNA can be xsolat .ubpopulation. 

— Lri«: rcrrinro a Plas.id vector before 



transfection. 
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AS xn the other m ^^^^^^ ^ 

" usually one -at ^^^^ ...stantially the same as 

phage or phagemxd. The p substrates for 

for plasmid-chromosome ^^^^^f^'^^'^ctor . Vectors including 
.eco^ination are ^^^^^Z: cells or in vit.o 
the substrates can then infection. Viral 

packaged and introduced xnto cells by 



29 



10 



•^h host chromosomes merely by propagating 
genomes recombine wxth host ^^^^^^^^ 
a culture. Evolution can be 7^,^^,,,p,,ation, or 

intercellular transfer of viral genomes f;;^^^^^^,^,,,^^ 
reinfection of cells by progeny ---^ J^f/^^ ^^^s that 
identifies cells having ^^-^^^/^f/^f J^.„c!ion. 
have evolved toward acquisition ox „„v,,eauent 

rounds of 

There are several options tor suts q ^^^^ 
Por exawle, viral genomes oan be trans£crr 
„con*ination. !"7^^,;^,i,„/,eoo*ination by , 

between -l^B -rvrv n^^^-^ /^^^^^^ 
electroporation. Aitern „„oled and used to 

surviving selection/screening can ^ 

Alternatively, 

superinfect the cells at h.gh ™";f^;=;.",„„^,ced into the 
fresh substrates for reccmbinatron can b. in 
cells, either on plasmid or viral vectors. 

^'^'^''^^^^^^^'^^..cr^i^^s described above, some 

in addition to the teem q techniques ' 

.aditionallv advantageous "O^^-^J/^/^^ f;/, These 

. 1 fn e erain' and -coarse grain- 

„«thods are referred ^^ J one to exchange 

Shuffling. 3ubstrate nucleic acids, 

chunks of genetic material be ^^^^^^^^ ,,,o^i„,„ts to 

thereby "^""^ domains, restriction fragments, 

exchanges ^ '^^';';;::tations. or other arbitrarily • . 
ougo-encoded "-"^^ ^ introducing diversity more 

defined =^9"'""' " . .^^e In contrast to coarse grain 
randomly across ''■V^*^^' ^.^hods allow the generation 

shuffling, fine grain shuffling ^ given 

30 XtirerrtitionrTncluding multiple 

set of very closely linked m 

permutations, within a single g,,i„ 

„ .ome — dTs — -robes 

shuffling techniques are not pe sequence. 
3S of all possible mutations -^^^ = ^/'^^ 3 sampling of 

Kather, these -^f^'^^^J^^^XL on known sequence or 
variation possible within a gene 
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ThP size of the sample is typically 
structural information, ^^^^^"^//^^^ .^.lection process, 
determined by the nature of the screen 

e,ampxe, «.en 3 soreen U pe-^^ ^^^^ 
.icrotiter forma .t aay be^ref ^^^^ ^^^^^^ 

the recombinant library to. 
for convenience in screening. 

. ^, 2s^^f.Kestricti^^ 

I„ seme situations it is advantageous to use 

.estriction en.yme sites ^ -f^^l^^:::, 
.scombination Of in the 

interest, ^h^' ^ „,diiy be shuffled by 

evolution of fragments tna ™-ated Dl» or other 

existing methods due to the presence °« 

problematic primary sequence motrfs . They are a J ^ 
Lr Shuffling large fragments '^VP-ally jeat"^ ^^ 
such as gene clusters that ""^^f^/^tl hough fragments up 
.pCR-anplified- because °' JZ^,,Ue6 by PCR (Barnes, 
to 50 Kb have , ..0 'l.^,,., it can 

H^tl ft'?ad W S-^- ^ " alternative 

be problematic for fragments over 10 kb and 

methods for shuffling ^^^^^ .^"elnucleases used 

TrhrcussT/re ^sa^r^^^^ 
= r; and s these, p---;;^ --irLrnr 
:r"e::i:s :rne:rnXardr::irnds'that aiio« 
reffi::::: ord^ed — j^sr rtdenri::";' 

restriction enz^e (or ^^'f^.^i^es (Sa,*roolc 

conventional restriction „,,,sis of 

«. =n MolecularCloning. CSH Press, ±yo , . j 
et al . , woipcuxoi. ._ introduction of 

rerird^rtrt^rsHLrtoTLreirarid se.ence by 

3. t^rsis (i.e. by --oration Of sil nt mut.^^^^^^^^ 
The DNA substrate molecules to Joe a g 
either be from in vivo replicated . such as a plas«d 



31 



preparation, or fro. PCR amplified nucleic acid fragments 
harboring the restriction enzyme recognition sites of 
interest, preferably near the ends o£ .the fragment. 

ly. at least two variants of a gene of interest, each 
hfv ng l or more mutations, are digested with at least one 
restriction enzyme determined to cut within the 
sequence of interest. The restriction fragments are then 
ioLd with DNA ligase to generate full length genes havrng 
shamed regions. The nu*er of regions shuffled wxll depend 
:„ the number of cuts within the nucleic acid f 
interest. The shuffled molecules can be introduced rnto cells 
« "escribed above and screened or selected for a desrred 
property, nucleic acid can then be isolated from pools 
Uibrarles) or clones having desired properties and =ub:-«^ 
to the saml procedure until a desired degree of improvement rs 

,n son« eoO^diments. at least one DSA substrate 
^lecule or fragment thereof is isolated and subiecte. to 
.mtaaenesis m some embodiments, the pool or library of 

restriction fragments are subjected to mutagenesis 
before the digestion-ligation process is repeated, 
before '"e aig comprises such techniques known 

"Mutagenesis- as usea nereiu v , 
in the art as PCR mutagenesis, oligonucleotide-directed 
lag-Sis, Site-directed mutagenesis, etc., and recursive 
Tequence reco^ination by any of the techniques described 

herein, . j« nn t-he 

An example of the use of this format is in the 

manipulation of polyketide clusters. Poly.etide clusters 
^Khosla et al., IISIEfl! ». September 1596) are typically 10 
: \b in length, specifying multiple large P-VP-^ides 
which asse^le into very large multienzy^e complexes Due to 
1 modular nature of these complexes and '•'^ 
of the blosynthetic pathway, nucleic acids -^"'^^"S/'^"""^^^ 
IS s can be exchanged between different polyXetide clusters 
. to generate novel and functional chimeric polyketides . The 
introduction of rare restriction endonuclease ="es .u h as 
Sfil (eight base recognition, nonpalindromic overhangs! , at 
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nonessential sites between polypeptides « i"^';™ ^^^^ 
engineered within polypeptides would provide -handles, with, 
engineeicu n,,rleic acid segments using 

which to manipulate exchange of nucleic acia 

the technique described above. 

• further technique for recursively reco^binxng 

= nucleic acid sequence utilizes "reassembly 
mutations in a nucleic q ^^^^^^ ^^^^^^^ 

PCR. . This -'hod c- be us ^ ^^^^ ^^^^^^ ^^^^^^^ 

that have been -P-^^^^; ,^,,„i^e is pertormed 

acid ""Plf^^/";,:;„ ;^rs mutants is >a>own from previous 
:::: : L: Id^tlfiea by screening mutants that^may have 

worK or n ^,„«„^ois technique known m the art, 

w«=r, r-vpated bv any mutagenesis cecimj.^" 

been created by y ^^^^e mutagenesis, doped oligo 

such as PGR mutagenesis, cassette mu y ^ . dNA 

mutagenesis, -mical — ^^^^^ Crrrdffirnr 
segments of a nucleic ac ^ ^.^^^^ 

in i""---/-;:-; ™' re era:iy, oU.onucleotide 

hr» have mutations or interest. . . 

Trirs ,oli.os, are synthesized for .C. , 

3e.ments of t^^^^ ^^^^^^ - 

the sequences of the oligon .vnicallv about 10 to 100 

ThP overlap region is typicaiiy <iu^^ 

two segments, ine oveixat^ a ,™„i-5f-i(ad with 

.1 „^v, Each of the segments is amplified witn 

nucleotides in length. Eacn or a 

3 set of such primers, as those 

"Trnt'L::!! rab^:erreile randomly fragmented 
used in secticns ^^^^^^^^ ^^^^^^ 

genes. In brief, in „,.„rs by, for example, gel 

0 first purified away from the primers, by, purified 
- .i= or size exclusion chromatography. Purifiea 
electrophoresis or size exci cycles 
products are mixed together and subjected to ^ ^ / 

'of denaturing, reannealing. and - 'a^'p s an^ 

polymerase and 

— subsequent PC with P-'^^-^-red 

gene are used to amplify the yield of the fully 
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and shuffled genes. This method is necessarily "coarse grain" 
and hence only recombines mutations in a blockwise fashion, an 
advantage for some searches such as when recombining allelic- 
variants of multiple genes within an operon. 
5 In some embodiments, the resulting reassembled genes 

are subjected to mutagenesis before the process is repeated. 

In some embodiments, oligonucleotides that 
incorporate uracil into the primers are used for PGR 
amplification. Typically uracil is incorporated at one site 

10 in the oligonucleotide. The products are treated with uracil 
glycosylase, thereby generating a single-stranded overhang, 
and are reassembled in an ordered fashion by a method such as 
disclosed by Rashtchian ( Current Biology . 6:30-36 (1995)). 
In a further embodiment, the PGR primers for 

15 amplification of segments of the nucleic acid sequence of 
interest are used to introduce variation into the gene of 
interest as follows. Mutations at sites of interest in a 
nucleic acid sequence are identified by screening or 
selection, by sequencing homologues of the nucleic acid 

20 sequence, and so on. Oligonucleotide PGR primers are then 
synthesized which encode wild type or mutant information at 
sites of interest. These primers are then used in PGR 
mutagenesis to generate libraries of full length genes 
encoding permutations of wild type. and mutant information, at 

25 the designated positions. This technique is typically 

advantagous in cases where the screening or selection process 
is expensive, cumbersome, or impractical relative to the cost 
of sequencing the genes of mutants of interest and 
synthesizing mutagenic oligonucleotides. 

30 An example of this method is the evolution of an 

improved Taq polymerase, as described in detail below. Mutant 
proteins resulting from application of the method are 
identified and assayed in a sequencing reaction .to identify, 
mutants with improved. sequencing properties. This is 

35 typically done in a high throughput format (see, for example. 
Broach et al. Nature 384 (Supp) : 14-16 (1996)) to yield, after 
screening, a small number, e.g., about 2 to 100, of candidate 
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recombinants for further evaluation. The mutant- genes can 
then be sequenced to provide information regarding the 
location of the mutation. The corresponding mutagenic 
oligonucleotide primers can be synthesized from this 
5 information, and used in a reassembly reaction as described 
above to efficiently generate a library with an average of 
many mutations per gene. Thus, multiple rounds of this 
protocol allows the efficient search for improved variants of 
the Taq polymerase. 

10 

C. Enrichment for Mutant Sequence Information 

In some embodiments of the invention, recombination 
reactions, such as those discussed above, are enriched for 
mutant sequences so that the multiple mutant spectrum, i.e. 

15 possible combinations of mutations, is more efficiently 

sampled. The rationale for. this is as follows. Assume that a 
number, n, of mutant clones with improved activity is 
obtained, wherein each clone has a single point mutation at a 
different position in the nucleic acid sequence. If this 

20 population of mutant clones with an average of one mutation of 
interest per nucleic acid sequence is then put into a 
recombination reaction, the resulting population will still 
have an average of one mutation of interest per nucleic acid 
sequence as defined by a Poisson distribution, leaving the 

25 multiple mutation spectrum relatively unpopulated. 

The amount of screening required, to identify 
recombinants having two or more mutations can be dramatically 
reduced by the following technique. The nucleic acid 
sequences of interest are obtained from a pool of mutant 

30 clones and prepared as fragments, typically by digestion with 
a restriction endonuclease , sonication, or by PGR 
amplification. The fragments are denatured, then allowed to 
reanneal, thereby generating mismatched hybrids where one 
strand of a mutant has hybridized with a complementary strand 

35 from a different mutant or wild-type clone. The reannealed 
products are then fragmented into fragments of about 20 - 100 
bp, for example, by the use of DNAsel. This fragmentation 
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taction h.s the effect of segregating regions of the template 
containing ™is.atches (.utant infor^tion. fro™ those encoding 
wild type sequence. The mismatched hybrids can then be ^ 
a Minify purified using aptamers, dyes, or other agents which 
Mnd to mismatched DHA. A preferred embodiment is; the use of 
muts protein affinity matrix (Wagner et al., Hu£lsi£3£^ 
Hes. 23(19) =3944-3948 (1«S, ; Su et al . , ^r.r N.rl . Acad^ 

m.a.A > . s^-.^OSlSOSUl^eO) with a preferred step of 
ampli ying he affinity-purified material in vitro prior to an 
Zll reaction. This amplified material is then put into a 
assembly PCR reaction as decribed above. Optionally thxs 
material can be titrated against the 

,e g from about 100% to 10% of the muts enriched pool) to 
L;!r;i the average nu^er of mutations per clone in the next 

round of recombination. ^ ^ . • -u^ 

Another application of this method is in, the 

^ i-haf are enriched for polymorphic 

assembly of gene constructs that are enri^i c t- 

eroccurrlng as natural or selected ^"elic variants or a 
differences between homologous genes of related ^P--^ 
example, one may have several varieties of a plant that are 
believed to have heritable variation ma trait of interest 
(eg., drought resistance). It then is of interest to 
construct a library of these variant genes containing many 
mutations per gene. «utS selection can be applied in 
combination with the assembly techniques described herein to 
::^rate such a pool of recombinants that are high^ enriched 
for polymorphic ("mutant") information. In some embodiments, 
the pool of recombinant genes is provided in a transgenic 
host. Recombinants can be further evolved by PCR 
amplification of the transgene from transgenic -3-"""= 
are determined to have an improved phenotype and applying the 
formats described in this invention to further evolve them. 

D Tnt-ron-drj '^"" recombination 

in some instances, the substrate molecules for 
recombination have uniformly low homology, ^^^^^f^^";^ 
distributed regions of homology, or the region of homology is 
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relatively small (for example, about 10 - 100 bp), such as 
phage displayed peptide ligands. These factors can reduce the 
efficiency and randomness of recombination in RSR. In some 
eiT±)odiments of the invention, this problem is addressed by the 
5 introduction of introns between coding exons in sequences 
encoding protein homologues. In further embodiments of the 
invention, introns can be used (Chong et al., J. Biol , Chem. , 
271:22159-22168 (1996)). 

In this method, a nucleic acid sequence, such as a 

10 gene or gene family, is arbitrarily defined to have segments. 
The segments are preferably exons - Introns are engineered 
between the segments. Preferably, the intron inserted between 
the first and second segments is at least about 10% divergent 
from the intron inserted between second and third segments, 

15 the intron inserted between second and third segments is at 
least about 10% divergent from the introns inserted between 
any of the previous segment pairs, and so on through segments 
n and n+1. The introns between any given set of exons will 
thus initially be identical between all clones in the library, 

20 whereas the exons can be arbitrarily divergent in sequence. 
The introns therefore provide homologous DNA sequences that 
will permit application of any of the described methods for 
RSR while the exons can be arbitrarily small or divergent in 
sequence, and can evolve to achieve an arbitrarily large 

25 degree of sequence divergence without a significant loss in 
efficiency in recombination. Restriction sites can also be 
engineered into the intronic nucleic acid sequence of interest 
so as to allow a directed reassemmbly of restriction 
fragments. The starting exon DNA may be synthesized de novo 

30 from sequence information, or may be present in any nucleic 

acid preparation ( e.g., genomic, cDNA, libraries, and so on). 
For example, 1 to 10 nonhomologous introns can be designed to 
direct recombination of the nucleic acid sequences of interest 
by placing them between exons. The sequence of the introns 

35 can be all or partly obtained from known intron sequence. 
Preferably, the introns are self -splicing. Ordered sets of 
introns and exon libraries are assembled into functional genes 
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by standard methods (Sambrook et al., ^^olPr.lar Cloninq . CSH 

Press (1987) ) . 

My of the tormats for in vitro or in vivo 
reco^ination described herein can be applied for "-^-ive 
exon shuffling. A preferred format is to use nonpalindromro 
restriction sites such as Sf i I. placed into the intronic 
sequences to promote shuffling. Pools of selected clones are 
digested with Sfi I and religated. The nonpalindromic 
ovLhangs promote ordered reassembly of the shuffled exons. 
These libraries of genes can be expressed and screened for 

desired properties, then subjected to ^-'^^ /--"^^ ^ 
of recombination by this process. In some embodiments, the 
libraries are subjected to mutagenesis before the process is 
repeated. ^ ^^^^^^ introduction of an intron into 

a mammalian library format would be used advantageously is as 
follows. »n intron containing a lox <Sauer et al. , SrSS^ 

,^ ...^ m ..A.l. 65:5166-5170 (1988)1 Site is 

'arbitrarily Introduced between amino acids 92 and 9 in each 
alpha interferon, parental substrate. A library of 10 
chLric interferon genes Is made for each of the two exons 
'residues 1-92 and residues 93-167, , cloned Into a replicating 
raslld vector, and introduced Into target cells. The number 
10« is arbitrarily chosen tor convenience in screening. An 
i exemplary vector for expression in mammalian cells """l^ 
contain an SV40 origin, with the host cells expressing SV40 
large T antigen, so as to allow transient expression of the 
interferon. constructs. The cells are challenged with a 
cytopathic virus such as vesicular stomatitis virus (VSV, in 
0 an interferon protection assay <e.g., Meister et al 
Virol. 67:1633-1643, (1986)). Cells surviving due to 
— sion of interferon are recovered, the two libraries of 
iTerferon genes are PCR amplified, and recloned into a^vector 
hat can be amplified in coli. The amplified plasmids are 
, then transfected at high multiplicity (e.g. ^l^-^TTJ 
plasmid per lo' cells) into a ere expressing host that c-n 
support replication of that . vector. The presence of ■ ere ■ in 
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the host cells promotes eftioient recontination at the lox 
site in the interferon intrcn, thus shuffling the selected, 
.ets of exons. This population of cells is then used in a 
second round of selection by viral challenge and the process 
is applied recursively. In this format, the ere recombinase xs 
preferrably expressed transiently on a cotransf ected molecule 

, fV.« host Thus, after segregation. ot 

that cannot replicate in the nose, nu , . , 

recombinants from the ere expressing plasmid, no further 
recombination will occur and sel,ction can be °" 
genetically stable exon permutations. The method can be used 
with more than one intron, with recombination enhancing ^ 
sequences other than cre/lox (e.g., int/xis, etc.), and with 
other vector systems such as but not limited to retroviruses, 
adenovirus or adeno- associated virus. 

5. n,io, ---— -^'H. «.di,t>a B,ro„fc»natioi 

1 nn ^^ hr-ld Tt ■rross senumce space 
in some embodiments of the invention, a search of a 
region of sequence space defined by a set of substrates, such 
meters of a gene family, having less than about SO. more 
typically, less than about 50% homology, is desired. This 
region, which can be part or all of a gene or a gene is . 
arlitrlrily delineated into segments. The seg^nt borders can ■ 
be chosen randomly, based on correspondence with natural 
exons, based on structural considerations (loops, alpha 
helices, subdomains, whole domains, hydrophobic core, surface, 
dynamic simulations,, and based on correlations with genetic 

mapping ^»'^^^^^^^^ ^^^^^^ ,„pli£ied by PCH 

, with a pool of "bridge" oligonucleotides at each junction. 
Thus, if the set of five genes is broken into three segments 
A B and C. and if there are five versions of each segment 
M . C4, C5), twenty five oligonucleotides are made 
or' each strand of the A-B junctions where each bridge oligo 
5 has 20 bases of homology to one of the A and one of the B 
segments. In some cases, the number of required 
oligonucleotides can be reduced by choosing segment boundaries 
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of the gene family members. 
'A^ni■^elAl in some or all ot tne yc 
that are identical in „^u*.aized for the B-C 

bridge cUgonucleotxdas, and = ^^^^ l-9th 

oUgonucleotides plus ^t^.^, .He dUTP/uracil 

genes «e »ade then made p^eEeraWy, products 

,l,cosvUse ■"->'°^;;"::f;Ar:: agenesis before the process 
^0 from this step are subiected a desired 

o« selection and ^-^fXZZZ t. a' desired property is 
level of i"*-^-^"^"' determined using a screening or 

::re::::n arrpp^pre^- the protein ana property o. 

{or the recombination ot eievei 
interferon genes. 

^''ll'ff'';^^ invention, sequence 
in some embodiments o£ the _ a 

information from one or more substrat s^uences^ 
,S given "parental, se^ence of ^,,,,,i,„. 
reco^ination between rounds °' „„.agenesis 
Typically, this is done with eite^di ^^^^^^^^ 
performed by techni^es '^"^^ „„e substrate 

al., BcleciOa^XlaBina. CSH Pr 
30 as template and . e.g. homologous 

stations from ,,r an improved phenotype 

genes. After screening or £.,rther 
of interest, the selected -^^^^^^-^^^f^Zr.. After 
evolved using KSK technigues -—^^ ^/^^^^.^ „n be done 

35 screening or ='le='"°"' . „ „j oligonucleotides encoding 

again with another ^^^^^^ ^ ^^^,3,,, ^.il the 

homologue mutations, and the an 
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desired properties are obtained. 

When the difference between two homologues is one or 
more single point mutations in a codon, degenerate 
oligonucleotides can be used that encode the sequences in both 
5 homologues. One oligo may include many such degenerate codons 
and still allow one to exhaustively search all permutations 
over that block of sequence. An example of this is provided 
below for the evolution of alpha interferon genes. 

When the homologue sequence space is very large, it 

10 can be advantageous to restrict the search to certain 
variants. Thus, for example, computer modelling tools 
(Lathrop et al., J. Mol. Biol. , 255:641-665 (1996)) can be 
used to model each homologue mutation onto the target protein 
and discard any mutations that are predicted to grossly 

15 disrupt structure and function. 

F. Recombination Directed by Host Machinery 

In some embodiments of the invention, DNA substrate 
molecules are introduced into cells, wherein the cellular 

20 machinery directs their recombination. For example, a library 
of mutants is constructed and screened or selected for mutants 
with improved phenotypes by any of the techniques described 
herein. The DNA substrate molecules encoding the best 
candidates are recovered by any of the techniques described 

25 herein, then fragmented and used to transfect a mammalian host 
and screened or selected for improved function. The DNA 
substrate molecules are recovered from the mammalian host, 
such as by PGR, and the process is repeated until a desired 
level of improvement is obtained. In some embodiments, the 

30 fragments are denatured and reannealed prior to transfection, 
coated with recombination stimulating proteins such as recA, 
or co-transf ected with a selectable marker such as Neo^ to 
allow the positive selection for cells receiving recombined 
versions of the gene of interest . 

35 For example, this format is preferred for the in 

vivo affinity maturation of an antibody by RSR. In brief, a 
library of mutant antibodies is generated, as described herein 
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£or the 48G7 affinity maturation. This Ubrary is 
purified with liean. to enrich for antibodres With the ^ghe« 
0 1 . 10% affinity. The v regions genes are recovered by PCR. 
ragmentea, and cotransf acted or electorporated with a vector 
into which reasaenl^led V region genes can ■^-""^^V ""^ . . 
.ubstrate .clecules are recovered fron, the co-nfe« a cells, 
and the process is repeated until the desrred evel 
i^rovment is obtained. Other e,nbodiments rncluda 
reLse.*Ung the V regions prior, to the =1-"°^°""°^ J 
that an intact V region exon can ^^^^'^"^^'^H;; ^^'^l, 
expression cassette, -rthar e*oai.ants .nclude ^he - o£^^ 
this format for other eukaryotic genes or for the evo 
whole viruses. 

Q p>,a qemid- PrF'"^ &Bsemblv 

in some embodiments of the invention, a gene of 

,„.erest is Cloned into a vector ^^^^^^.^^^^^ 
stranded DNA. such as a phagemid. The resulting 
is mutagenzied by RSR in any method knovm xn the art, 
, transfected into host cells, and subjected to a screen or 
' relection - a desired property or -J^-^^^;^^^ 
,ro. the selected or screened P---- - Z X^J^^^ 
example. PCR or plasmid preparation. This DNA prep 
contains the various mutant sequences that one wishes to 

Z This DNA is fragmented and denatured, and annealed 
" "rsingle stranded DNA (ssONA) phagemid template (ssDNA 

wild-tvpe gene and vector sequences) . a 
referedtr^^^^^^^^^^ ^^e of --^^ 

such as CJ236 (Sambrook et al.. Moleculm^^lsniHa CSH Press 
nqa7)) for the preparation of ssDNA. 

(1987)) xor ^ , . ^„„^T-te are filled with DNA 

Gaps in annealed template are riixe 



Since 



-ri:^::ircrn ir tir prar^^rnW 

rtreiirstrand now consists of shuffled These 
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synx-ii^"-^^' o^^a^n of E coli whicn 

products are transformed into a muts strarn of E. 

Is dut. ung*. Phagemid DNA is '".^^f/^/ 
host and subjected again to this protocol untU the -dasrred 
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.;^:r:e™e, ..... ... ... 

process is repeated. 

Ill T^ r""^*^'^ Pr rt-i" Tg^ression ^ v,p a very 

. • ^ T=-rrrP nure . and homogeneous 
,e„e«l ™e.hoa , ! ;r::;uence. of interest, 

^antitxes of almost all nucl production o£ 

similar generality has not ye.; „corrtinant £6rm. A 

larse amounts of pure, homogeneous protern in recombr 
lik!ly explanation is that protein expression. _^ 
wLatln ana staMUty is --"-"^/^^/.tX a 
unpredictable than for B». ^■^t/^^/^/^::::!::! J„.es, 
co^lex function of "--"^""^^^f;;^' n„cent 
interactions with the ribosome, interaction 

Tolypeptide With chaperonins ^^^^^ -^l^s ;f 
..aciency Of ^^^^^^^ protease 
secretion and otner prot«-^ „x final folded 

. • =«ri t-he intrinsic stability of the tmai .i-^ 

sensitivity,; and the intrins . . suited for 

nr.i-"imization of such complex processes is weix 
state. Optimizacion oj. ^^.u^jc riptail strategies 

«-F r<;r The following methods detail a 
the application of RSR. me . . ^. protein expression, 

for application of RSR to the optimization of protein P 

^^^---^^^^^^^^^ coli codons on expression 

The negative ^"^^ ^Z;//;^^^ ,,,, demonstrated 
recon^inant proteins nt.s^ However, 

'"''TZel\^^''^^^^^ of codon usage patterns to optimize 
° Tx-r Actional protein hav J^^^ 

e^odiments Of the ^^0 ^^^^^^^^^^^^ .^sed on the 

changing codons used in the gene ° accomplished 

degeneracy Of ---^ J,^^^^^^^^^^ - 
35 by synthesizing the gene using ^ -^^ ^^^^ the general 

some embodiments the degenerate oligonucleot 3^^,3t^ate 
structure of about 20 nucleotides of identity to a DNA 
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the oUgonuclectid.s a« identical to th 

one 5. and one 3' nucleotide, bu have "^^f/^^^^^^^, „ 
ho™=logy to the B» suhst«t. - «u , - ^ 

the use of degenerate codons In some ^^^^ oligonucleotide 

degenerate cligonuclectides x= ^J^^* ^^^..^ „ is 

overlaps with another by the ^^^^^f tgonucle^ides are 
the length oj the oligonucleotide Such olig ^^^^^led 
typically about .0 - 1000 nucleotides n lengthy T.^^.^^^^^ 

,enes are then ^^^'^^ ZZlVl^^.^'^s^.. '^ subjected to 
rcriteTeroriratiorjLe as descibed above until the 
aesired i^prove^n^ - achieved^^^^ ^ ^^^^ " TnT 

X. v.=i. = =^ (BTAP) for active expression m £^ 
. i. ^♦-^«ai alkaline phosphatase iBiAf^ j-uj. 

intestinal aiKaiine t> t< ^ - reoorter gene in assay 

This enzyme is commonly used as a reporter g 
S^atsTcH afK^lS.. ..e cloned — t ^ -re.ed 

,otive .or™ in a ^ r;r:-:;iraUo« L to access 

"'"r::: : p: l « L B1.P and. .^ortantly .or 

inexpensive express ^^^witv or chemical coupling 

' -^Tr:::: zr: rir:::i:::rL . ^ -iiea 

rr^le - P-ide in the section. 

o TTTi p-roved Folding 
0 ■ - -^reHriresrrnt^^^^ ^s.. 

rrn:i:ri:nTorir -itb rra^onty o. the e^d protem 

XToLd in insoluble aggr gates^ .ch 

.eco^ination ^^^^^^^^^ ways to in^rove folding. 
,5 target proteins. There are l^J\ \ interest and 

including mutating evolving the target protein 
evolving chaperonin proteins. 
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a Inclusion.Bod:LJ:rac ^ Selecti on 

lac HeadEiece^iser.y^i'^^"^ Proteia 

..... ":.-t:irp"^^^^^^ 

linker which binas the „iii regain bound to 

polypeptide fusions to ^"'l^^^J an affinity purification 

.he plas»ia ,SS,3,3-3S. . This 

process {Gates et al.. --— ^^oive mutant proteins 

T,r..rr.:; -—tn: r.:.:r. 

target protein sequence xs -""""'"^^^^^ j^st one lac 

-es - 

operator, is mutagenized oy directed 

PC mutagenesis, chemical -^^^^^ ,„3S U.»v>.. 
^.agenesis .Sa^hroo. e -^^^—7^, ceU, and 
The resulting l^""-^," ,^ preferably with 

egression "--Jj;;:/ "y ,.„erated fro™ a culture of 

arabinose. *n extract or y ,„soluble protein is 

the library expressing the construct^ Ins „„„,£„,ation 
fractionated fro. soluble P-"-/^'*.^^';, Jie protein/D». 
or affinity chromatography /^^^^/^^ „,^„o. et al.. 

complexes is --^"^-^t^T. 7 oHhe plasmid. Preferably, a 

BslSSJil^^^lS"^' '^^^riy folded protein, such as a , 

reagent that is specific for P"P« J „ p,,ijy soluble 

monoclonal antibody or ^^^'^^fj:^''^,,^ ... step is 
protein/DN^ -""Pl"";^ J^^^^ expressed. These steps are 

» isolated, protein/DNA oon^lexe, has 

repeated until the yield or individual clones are 

_hed a desired --^^ CcX^^^^^^^^^ of the 

then screened for retentio activity, etc. 

protein of interest, -=^^=;" ^::^ly useful for evolving 

This technique is 9--"""^ trafficking of 

solubility and other P-P^"-/ '^^^ cell of interest. For 

proteins heterologously expressed in a ho 
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example one could select for efficient folding and nuclear 
: Ition Of a protein fused to the lac repressor ^^'^^ , 
aimer by encoding the protein on a plasmid encodrng an SV40 orrg^n 
o, replication and a lac operator, and transiently expresBxng the 
fusion protein in a mammalian host expressing ^ 
purification of protein/DNA complexes from nuclear "^■'^ 
,seed and Aruffo. Vynr ,r,^. ^j. , 

,1587)) would allow one to select for efficient foldrng and 
nuclear localization proteins. 

b. li«°r - r'"" """^ 

Display 

problem often encountered in phage display -"ethods 
such as those disclosed by CHeil et al. , p , rre„t Bi^l,gY , 5,443- 
: .15.5,. is the inability to functionally express a protexn of 

I In ohaae without being limited to any :one theory, 
interest - P^^^- ^^^^^^^^ ^ responsible for 

improper folding or cne y^^^ - int-P-rest for 

this problem. RSR can be used to evolve a protexn of interest f 
CctLal expression on phage^ ^^.-ly a 
constructed between gene III or gene vili 

• fr^T pxamcle bv PGR mutagenesis. ine 

and then mutagenized, tor example by ^ _ 

mutagenzled library is then expressed .n a ^^^^e di P Y 
Phage lysate is made, and these phage are affinity selected 
those bearing functionally displayed fusion proteins using an 

L: ::t!ix containing a ,.nown llgand for 
Dm from the functionally selected phage is purified and the 
Z ayed genes of Interest are shuffled and recloned into the 
phage display format. The selection, shuffling and recloning 
fteos are repeated until the yield of phage with functional 
„ I p L; d Trotem has reached desired levels as defined, for 
!xa^ll, by the fraction of phage that are retained on a^ligand 
a«inlt;. matrix or the biological activity "^^^^^^^^ ^^e^^,^ 
displayed phage. Individual clones are then "^f/'^^f^, 

candidate mutants with improved display P-P^""- / ^^^^ 
5 of expression, and functional properties of interest (..g. , 

ability to bind a llgand or receptor, lymphoUne activity, 

enzymatic activity, etc.). 
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in some einbcdiments of the invention, a functional 

screen or selection is used to identify an evolved "f 
screen or protein, which cannot initially 

expressed on a phage. The '"9^^ ^ ^t.genized and 

be efficiently expressed in a host of interest, 

a fun tional screen or selection is used to identify cells 
X :::sin, functional protein. Por exa^le ^-e P-ein o 

rrdt rarrrre-i^oved functional expression fro. 

such a --^^^-rr:::;se transcriptase is of particular 
co^ercialTmportaL hecause it is active - -^^he^^^^^^^^^ 
(420 c) and is more robust than many other reverse era 
Lei, it is difficult to express in P--^-^ ^^^^ Z 
„li, and is consequently ^^^^^ "Zell 
rMirified from chicken cells. Thus an evoxve 
trans^rtptase that can he expressed efficiently in coll is 

.i,hly deairahle^^^ ^ ^.^e transcriptase -ne .Papas et 
J Biochem 20:95-103 (1982)) is mutageni.ed by any 

tj^^-^^nrZ:^^. The library mutant genes IS Cloned 

fnto a colBl plasmid <.mp resistant, ---f^L^ISL^ 
propter in a pol.12 .Ts, c„l»^.Swea y t al.^^.^_ 

r^^A TT Q TV 90:4626-4630 (lyy^M Et^ — i^Mii 

^^^^h IPTG and shifted to the nonpermissive temperature. 

rsti:::: r^rctionauy expressed r^^^-^:^:, 

genes under the selective conditions reported or selection 
active HIV reverse transcriptase mutants reported by Kim et 

, sII^^ITIhV RTX genes are recovered by PCR by — ^ 
riigonucleotides flanging the -e^^ ^^^^^^^^^^^^^^ 

products are sub^^te to ^^^^^^^^ 

above, and P-«== ^ /^..^...^i ,i,„,s are then screened for 
expression is acceptable. properties of interest 

5 R».-dependent D» polymerization and other propert 

,e.g. half life at room temperature, ^^^^^^^ ^ again to 
clones are subjected to mutagenesis, and then 
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yield an AMV RT that can be expressed in E. coli at high levels. 
2 Evolved rVtaperonins 

in some, embodiments of the invention, overexpression of 
a protein can lead to the accumulation of folding intermediates 
which have a tendency to aggregate. Without being limited to any 
one theory, the role of chaperonins is thought to be to stabilize 
such folding intermediates against aggregrat ion; thus, 
overexpression of a protein of interest can lead to overwhelming 
the capacity of chaperonins. Chaperonin genes can be- evolved 
using the techniques of the invention, either alone or in 
combination with the genes encoding the protein of interest, to 

overcome this problem. . n 

Examples of proteins of interest which are • especially 
suited to this approach include but are not limited to: ^ 
cytokines; malarial coat proteins; T cell receptors; antibodies; 
industrial enzymes (e.g.. detergent proteases and detergent 
lipases)-; viral proteins for use in vaccines; and plant seed 

storage proteins. -,. j 

sources of chaperonin genes include but are limited to 
E coli chaperonin genes encoding such proteins as- thioredoxin, 
Gro ES/Gro EL, PapD, ClpB, DsbA, DsbB, DnaJ, DnaK. and GrpE; 
mammalian chaperonins such as Hsp70. Hsp72, Hsp73, Hsp40,Hsp60, 
HsplO, Hdjl, TCP-1, Cpneo, BiP; and the homologues of these 
chaperonin genes in other species such as yeast (J.G. Wall and A. 
Pluckthun, n„...n^ Biology. 6:507-516 (1995); Hartl, 
381-571-580 (1996)). Additionally, heterologous genomic or cDNA 
libraries can be used as libraries to select or screen for novel 
chaperonins. 

in general, evolution of chaperonins is accomplished by 
first mutagenizing chaperonin genes, screening or selecting for 
improved expression of the target protein of interest, subjecting 
the mutated chaperonin genes to RSR, and repeating selection- or 
screening. As with all RSR techniques, this is repeated until the 
5 desired improvement of expression of the protein of interest is 
obtained. Two exemplary approaches are provide below. 
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a. r>.« ppronin E^ rr.^ ..'^ \ an in Tranfl to the Protein 
r^f Tnteresfr with a P rr«.en or Pelection for 
Tm proved Function 
in some embodiments the chaperohin genes are evolved 
independently of the gene(s) for the protein of interest. The 
improvement in the evolved chaperonin can be assayed, for example, 
by screening for enhancement of the activity of the target protexn 
itself or for the activity of a fusion protein comprising the 
target protein and a selectable or screenable protein (e.g.. GFP. 
alkaline phosphatase or beta-galactosidase) . . 

b, rhaneronin Qpero n in cis 
in some embodiments, the chaperonin genes and the target 
protein genes are encoded on the same plasmid. but not necessarily 
evolved together. For example, a lac headpiece dimer can be fused 
to the protein target to allow for selection of plasmids whxch 
encode soluble protein. Chaperonin genes, are provided on^thxs 
same plasmid ("cis") and are shuffled and evolved rather than the 
target protein. Similarly, the chaperonin genes can be cloned 
onto a phagemid plasmid that encodes a gene III or gene VIII 
fusion with a protein of interest. The cloned chaperonins are 
mutagenized and. as with the selection described above, phage 
expressing functionally displayed fusion protein are isolated on 
an affinity matrix. The chaperonin genes from these phage are 
shuffled and the cycle of selection, mutation and recombination 
are applied recursively until fusion proteins are efficiently 
displayed in functional form. 

3 Tn^ p^oved ir «-^».r^*^lliilar Localization 
Many overexpressed proteins of biotechnological interest 
are secreted into the periplasm or media to give advantages in 
purification or activity assays. Optimization for high level 
secretion is difficult because the process is controlled by many 
genes and hence optimization may require multiple mutations 
affecting the expression level and structure of several of these 
componenL. Protein secretion in coli, for example, is .n^ 
to be influenced by many proteins including: a secretory ATPase 
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(SecA) , a translocase complex (SecB, SecD, SecE, SecF, and SecY) , 
chaperonins (DnaK, DnaJ, GroES, GroEL) , signal peptidases (LepB, 
LspA, Ppp) , specific folding catalysts (DsbA) and other proteins 
of less well defined function (e.g., Ffh, FtsY) • (Sandkvist et al., 
5 Curr. Op. Biotechnol. 7:505-511 (1996)). . Overproduction of wild 
type or mutant copies of these genes for these proteins can 
significantly increase the yield of mature secreted protein. For 
example; overexpression of secY or secY4 significantly increased 
the periplasmic yield of mature human IL6 from a hIL6*pre-0mpA 
10 fusion (Perez-Perez et al., Bio-Technolooy 12:178-180 (1994)), 
Analogously, overexpression of DnaK/DnaJ in E. coli improved the 
yield of secreted human granulocyte colony stimulating factor 
(Perez-Perez et al., Biochem. Biophvs. Res. Commun. 210:254-259 
(1995)) . 

15 RSR provides a route to evolution of one or more of the 

above named components of the secretory pathway. The following 
strategy is employed to optimize protein secretion in E. coli. 
Variations on this method, suitable for application to Bacillus 
subtills, Pseudomonas, Saccaromyces cerevisise, Pichia pastoris, 

20 mammalian cells and other hosts are also described. The general 
protocol is as follows.. 

One or more of the genes named above are obtained by PGR 
amplification from £?. coJi genomic DNA using known flanking 
sequence, and cloned in an ordered array into a plasmid or cosmid 

25 vector. These genes do not in general occur naturally in 

clusters, and hence these will comprise artificial gene clusters. 
The genes may be cloned under the control of their natural 
promoter or under the control of another promoter such as the lac, 
tac, arabinose, or trp promoters. Typically, rare restriction 

30 sites such as Sfi I are placed between the genes to facilitate 

ordered reassembly of shuffled genes as described in the methods . 
of the invention. 

The gene cluster is mutagenized and introduced into a 
host cell in which the gene of interest can be inducibly 

35 expressed. Expression of the target gene to be secreted and of 
the cloned genes is induced by standard methods for the promoter 
of interest (e.g., addition, of I mM IPTG for the lac promoter) . 



50 



10 



. •^r, Kv a library of mutants is 

measured, for example by „ionie« 

.... ...^^ind^ ia ^ 3-e"d protein .the top O.X - 

expressing the "^f ^,^,4. Plasmid DSA is prepared 

To! fhtroiu:: thLnea^— to. an. o. t. .thods 

°^ n:rra.iV, each individ^ai ^ ^ ---t:/,^^^^ ^th 

^ ^ pqR « The fragments are v^j-^ 
population and subjected "^^"^ „onpalindromic overhangs 

X L liy ™^ Ugase, and Ugated 

designed to promote ordered r jobation of 

together, preferably at I- J/^icroliter) . Bach of the 

-''''r.:::^:^^^^ prior to r.asse^ly 

PGR amplified gene pop ^ ugation products are 

rnrrrblorrntX^ost^:. inUst and the cycle of 

— tz::^,^^ — r-r crir 

- — r:rg:r u:;er:U are targets for 
0 homologs of the E. coxi g homologs have been 

optimization, and ^^^.^.^.t^ 57 :SO-XOS 

identified ^" jro'tL n J;gs, other co^onents such as 

(1993)1. in addition to tne . __„i.tion particle, the 
.he Six polypeptides of the "^^^ ^^.^^L (T»«) , BiP, the 
, trans-locating 7;;^:" - 

ssa proteins and other hsp/u ^3, (1553,, are targets 

,3i.onen and Pulva ,,,,„„al vectors 

£or optimization by RSR. l S „oleculaE-a2niB3. CSH Press 
such as SV,0-neo -Bambrook ^t ' ■ ^^^^^^^^^ll^^^as ,,5„)> 
30 (1.8„ , ■'-"•-^^ ; ;;n;;n;^Plasmids for yeast 

£or ma-alian "^^^ "^^^^^ialaffijI^StJCSma^ 
(Strathern et al., integrative vectors 

^^3^^^, CSH press ,or B. subtiJis 

such as p» 103, p™ " ff„^ "sjil.^.yb,ill^->ii^ 

3s (Peregc, ^hap- 42, PP^ '^=',,„ensh.in, Hoch, and R. ^osiOc, 



eds.. 1993) 



, »„ efficiently secreted thermostable DNR 
livea thus allowing the pe.£or».ce of DH. 
polymerase can be evolved th ^ ideation of the 

polyvnerization assays ^.^^ be preferred for 

expressed D» P° V^e^ase^^^* J 
, the expression of 1^^"'"= ,3,,^, for exan^le any of the 

„ished to test rn a ^^'^ '^^"/^^ ^^^^ or any industrial 
pharmaceutical P"" ^J/^/f^e »ade by fusing a signal peptide 
enzyme. Initial constructs ar terminus of the 

snch as that fro. STXX or ^ - .0 

0 protein to be secreted. A gene ^tagenized and 

act in the secretory pathway °^;;^^;;=\„,,,iaual clones are 
coexpressed with the target c ^^^^^ ^^^^^^^^ ^^^^ 

screened for expresion o the g ^ ^ and 
clusters from improved clones are rec ^^^^ 
,S introduced bac. into the ""^^^^ is repeated . 

r c;:rir:erea:en:rrtr^^^^^^^^ irovement m expression 
of secreted protein is achieved. 

A. !a;2l^'S'*-I^^''*^*'^='-^*^T:dustrial interest that have 

:::rvrty1ntrsrstrate of mterest. 
substantially -^^""/V^^f ^^.^^ obtained fro™ nature is 
in many of these cases. ^^^^ ,,,, .ery different 

25 required to work either und ^ activity 

f.om the conditions "f^- f the natural substrate, 
towards a substrate that ^= technologies to 

=PP"=--;;;;;;,';rc:nt/y Umlted by the types of 
industrial enzymes "j'" modest numbers of mutants 

30 «l-tions that can be applie enzymes or 

that can be surveyed in sere airplay format, for binding 

catalytic antibodies, ^^"^^^^^^^J^^'t l.. . Apel^Ji^ 
to transition , =„r au^-ate analogs Uanda et ai., 

g^^^j^^j^ ,7:157-m 5^.2532.2536, (1994) ) represents a 

catalytic efficiency. 



10 



, ,1 Currsnt_li2l2fflC 5:443-449 
Phage display (O'^eU '\'' -^^^:Z:Zm,.^^ 
,,„S, ana the o.he. ^^^f "^^jLi.^^.^^ 

jjj^^ "^'<'"-'°", " ' ' finlty-based selections to proteins 
^thcdologies for applying affin V ^^^^^^^3. 

ot interest. For example, \ 3^ substrate to 

a.93„ have used phage di P ay of ,a^p^^^ ^^^^^^ 

.elect improved =<*=";';=;^^^ 3Uo„s selection of mutant 

surface o£ phage, on f'J'^ ^^ ,„,iog binding, 

proteins with transition state analogs correlate 

improvements in ^"^''^'^^^^^'J^^^l.^^.y . .or exan^le. Patten et 
with improvements in ""^^"^ ^^^^ that improvements an- 

al., for its hapten are well 

affinity of a catalytic -"J°°*/;J , efficiency, with an 80- 
correlated with ^"^7;;;;^=^:„f,: Jeved for an esterolytic 
fold improvement in kcat/Km Being 

antibody. . i„ antibiotic biosynthesis 

For, example, an enzyme ^" under 

can be evolved for new ^^icns. Some 

, aesired conditions using phage d splay^s^^ ^,,,„tions of 

antibiotics are -"-'^V^f/ J^^^,,^. complete biosynthesrs 
biologically produced starting P „„i,al because of the 

o, .he desired '-'^^."^^l^; j:^^, L.ymatic activity and 
lack of an enzyme with the req 10,324.329, September 

, substrate --"-^-^^^rd tc^ephalosporanic acid 
,„2) . For exa»^le , ^ . ,,,i,ally produced 

^ ^-^""rj: is Ide by a Chemical ring expansion of 
cephalosporins. 7-M)Cfl deaoylation of the 

penicillin . followed ^^^^J^^ enzymatically f ro» 
30 phenoxyacetal group. 7-»CA ^^^^^ ^ ,,,„ed 

deacetylcephalosporin C ^AOC K ^ ^.^^^^ 

from penicillin V by enzymatic ring J (cantwell et al., 

Pitied penicillin ,.m>« could in principle 

^^^^ 17,.13-221 T^f^;.„ , a modified 

3s be produced enzymatically 'l^l ^^^^l^, of the S. 

-xt: tr::::' .rr:d,iu. ^o..-3., septe^er 
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Kno«n expanaase With .u«.c.e .^.^^^^^^^^^^^^ 
useful. AS outlined below, RSR techn q _ ^^^^^ 
used to evolve the penicilUn ^ ,„bstrate. 

.„ i-hiit thev will use penicillin V as a su 

= — %::rai;ia; o. othe. aispi. rrrprctilin 
..Plied to --;,::;r.:rt;r^^^^^^^^^^ — - 

expandase mutants in a display applying RSR to 

^ «^ t-i-ansition state analogs, ana apF^y » 
substrates or ^,„,,,3. candidates are further 

"pidly evolve^^ am^^^^^^ ^^^^ on 

screened to identity conditions, such as pH, 

penicillin V under «a"-* applied to 
teu^erature, solvent „p,„aase activity. A 

further evolve ,nutants with '"^^^"^ ^^itable for this 

,S nun^er of transition state a^al^s TS. > ^^^^ 
The following structure 
r srectiln Of the display library of cefE .utant.: 



20 





COjH 



25 




'bOaH 



O 



30 



* COaH 



35 



.ibraries of the 

™ "'"^r::"::!: "en:-: di=piay ubrary is 

r:o'":ele ion or binding to penicillin V and/or to 
::: :::: n^st ranlL, .iven a-oove for the conversion of 



. V tc DAOC V. These binding selections may be performed 

penicillin V to DAOC ,^„aitions, such as elevated 

under non-physiological „„aer the ne» 

te„.erature, to obtain mutants ^^^ 'J-J^^^^ ^^^^ ^ . 

conditions. -^.^P^'t! " for the selecting llgand. «hen 

i„^rove.ent in "^-^^^^f ^as been obtained, candidate 

.he desired level ° /^/^''^t^.^^^nput format and specific 
mutants are expressed in ^ quantitatively 
activity for expanding P--;^^;;,^,,, activity are 
measured. /-"^ /J \ ^ Tea to .rrther evolve the.. 
—^^1:1::: :r;:l hmding by a -sprayed en.yme .e.. 
Phage display, lac headpiece dimer PO v;"- 

good selection for retention of the o ^^^^^ 
Live site and hence can ^ ^^^'^^-'^^^'^^'r ^^oh conditions 
„tain activity under conditions °* -'"-^ ,,„,aer pH 

.„olude but are not Jf/ron, a^MSO CSeto et al., 

optima, ts,. or formamide (Chen et al., 

^^^'^^ miVi S0:S.18-5622, altered 
H^»J-^='"*-^'''^-^iilf life, altered or broadened substrata 
temperature, improved ^^^^ _ ^ .^^.^ar example, the 
specificity, or protease 3 mammalian display 

evolution of a p-nitrophenyl esterase, u 
format, is provided below. 

- - ---'-r'-S^^^'t^^^Tre improved 

°^^r :^rc rrcid s^c^encmg and polymerase 

polymerases for use m attractive 

.-^r^c The following properties are attra 
chain reactions The fol ^.^^encing polymerase: (D 

candidates for improvement of a DN qu _^ ^^^^^ 

30 suppression Of -min^^^^^^ 

(H. Dierlck et al., JSslElSiS TTTTiv with f luorescently 

„re normalized pea. heights, x^aXS 
labelled dideoxy terminators ^l^^i;^'^ ..-ZH^Z^ i>^0. 
1.1 ,1,85)1, (3) batter o* ^^f ^^^^ .1., 

35 GC. by. for example, "l""^ . 

ri:roradi;;a;;;;^ei base 
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n deaza dGTP (Dierick et al.. ISusieifi 

fir^iHg Res. 2l>^^^ ^ 

improve the above properties ^ aescribed which use 

Novel sequencing formats have ^^^^ . ^^t- 
• . A laser desorption ionization time of fugni: 
„>atrix assisted ^--^^ J ^ leaders- (Smith, Hature 

TOF) mass spectroscopy to resolve Y ^^^^^^ 

iMn^ 14:1034-1085 1 9 )K t • 

.eview that ^-^--^^ f ^^f^ ^d as a viable alternative to 
limiting the development of this met ^^^^^^^ 

--^-^ riirtreTgi:^^^^^^ Zdr ii-ations Of the 
which stabilize the w gj-y^ , ^ RaDid_C2!mn*_iIi 

,..i.e bases ^oj--;-- ^dll^l-^^^^iT^^ 
-^/ei "e^greiri - .ass range imitation™ of .his 

^! ^-h 1996) Thus, evolved polymerases that can_ 
technique (Smith, ^'f ^'^^^^^ other base analogs conferring 

efficiently incorporate these ana conditions are 

resistance to fragmentation under MALDI TOF CO 

valuable innovations. interest for improvement 

Other polymerase properties of ^"^"^ ^ ^^^^ 
low fidelity thermostable DNA polymerase for _ 
by RSR are low fidelity correlate for acceptance of 

efficient mutagenesis or as a useful ^. 
base analogs for the purposes ^^^^^^^^ ) , higher 

polymerase for PCR (Lundberg ^ ' .^ne therapy 
fidelity reverse transcriptase or -J %^„3,,,,, of 

, vehicles to reduce --^^'J^^^ pcH with modified- 

the retrovirus; ^^^l^l^^^'l^^^Z , ^i^T^mi^ -^^^^ "''''^ 
bases (S. Turner and F. J. Jenkins. 

(1995^^- , invention, libraries of 

..... in see ;;X:t_, ...ou.hpu. 

;0 mutant polymerase genes are sere candidates 
screening «cr i^roved^— ^^^^^^^^^ 
„e then subjected to ^"Jf constructed using standard 

polymerases such as Taq polymerase are 

Ltrods such as PCR mutagenesis <"^^/7;^ ;3l^,;;^inrT^ - 

3s .,.8-33 (IS.... -a/or r ro^ration of mutations 

^,,^_S1^' <^=« r/th active site residue from T7 

into Taq DNA polymerase such as the active 
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..stance of dideoxy nucleotides (Tabor 
poly™era.e that i.pr°v.s „„„„ and stations 

and Richardson, i-E^^^T^^^^.i^.^e activity (R.S- Kano. 
,.at ^"-'";"^;7,;.3,, ,,„S,) are incorporated into these. 

^'^^"'^'^ne rersBCn^ly PCR technics, for «a,^le. as , 
libraries. The reasseinoix problem. 

aescribed above is -P^^^^/^/ri fa /e - ^ "^"^""^ 

Similarly, -'■^^"^ .^"'^""^.ef sequanase, and E. ccU poll 
existing thermophiUc f 7^f;: ;,,,l,ieotide methods described 
with each other uBxng the brxdge 9 ^^^^^ 
above. The libraries are ==^-=^^^ ^..^ individual colonies 

robotic colony picking is useo f polymerase 

into S6 well plates where small cultures 

expression is induced. purification for 

;;:;::il expression and purification ^„ ^.rform 

A high throughput sequencing assay is 

' .eguencing reactions with the 

analy.ed to --^rl^Tgher Juty ladders on 

according to any of th.se c ^_ including such 

.C-rich «7--:;j;:::::rte™ nation products and stronger 
^°^":;:h r: the wHd-type .n^yme,. less termination of 
signals than given i-belled reactions, e.g., 

reactions by inosine - i„ incorporation of 

fluorescent labelled primers; 1^==^" nucleotides at any 

Signals in reactions ^^^^ ''''''^.^^^J^^TZn obtained with the 
30 given position; longer ^^'^^^^^I'l'^l nucleotides; improved 
„iia-type -zv^e. ^ ^ ,,,, as v-deaza purines; 

acceptance of other knovm d combinatorial 

=^1= fserfrex:^^^^^^^^ - 

• The best candidates are then subjected to mutaSenesis, 
and then s^ecLd or screened for the i».roved seguencing 
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properties decribed above , ^.^een or selection is 

- ano-er ^^^^^l, „ can be placed 

performed as follows. The repxxc ^^^g^^ in E. coli or 

another mcroorgan.sK,. „,a™aUan 
demonstrated for making plaemid ^ ^^^^^jsi^JjLS^ 
polymerase beta (Sweasy et al.^K-^— —-^^ 
,0:46.6-4630, <19S3) > ■ 'f,!!"!!!, n or HIV reverse 

transcriptase (Kim et al., ^ '--TZZls placed on a plasmid 

ese (199S,). The mutant P°lvmerase gene rs P ^^^^^ 
= r-nlEl origin and expressed under cne 

arabinose in the screen utili^s the presence o£ 

^ further quantitative sere ^ ^^^^^^ 

- ..reen "-"-""rr nrerlissive tLperature in the 
plating onto arabinose at the no p _ fi^orimeter to. 

> absence of a selective ^''^^-^-■J"^ TaJoulture. CPP activity 

rererrrw "s-t^iuty a„d^^ — - ^» - 

:s superior sequencing enzyme, " 'J^^ „ ,,„ently 

.ignal for --P;;-™;/, ^fd Llles Lause it will have reduced 
used fluorescently labellea currently used 

specificity and selectivity. The error ^^^^^^ 
polymerases are on the order- „j ^he 

30 lower than -hat ^^^^J^^^'^^^ and possibly as high as 10%, 
gel systems. « ..^ .^sterns, and thus there . is a 

could not be "/".J^ease the .sloppiness- of the 

,arge ^ ^re : lung ^^^^^^^ 

enzyme. An errox 
35 such as for hypermutagenesis SJ^^^ ^ described by Suzuki 
in some embodiments, f = =5^""" ,6 = 9670-9675 

(Suzuki et al.., E.2^J.tL^=^i-^=^^^^i^ 



58 



10 



15 



20 



,s .sea to ™a.e repUcaCion ^^^^^^j'"^' 

replication of the tirst 200 polymerase (Sweasy and 

directly under the control o£ the ^^^^^^^ ^ ^ jia^JalU 

, h J Bact 177:2923-2925 (1995); S»easy ec al., K 

^ ^^S^ 90.4626-4630 U993)). A Bcreenable or 
A --"' - . . eodons is positioned in 

selectable reporter gene containing stop oodons P 

this region, such as LacZ alpha containing one, two 

rons.^ The constructs are anfplated on 

nonper^issible temperature, ^^'^^Jl^^Z r^.^Jor. ot th. stop 

'::z::::rL:^^ p"es recovered 

codons in the reporter enriched for 

V.,, DPP The survivors are ciij.*'- 
from the survivors by PCR. Tne s ,„^^„„. the rate of 

. . h.^ause their mutator phenotype increases tne 
mutators because tne „Dorter laeZ alpha fragment, 

•reversion of stop codons in ,re subjected to 

.he polymerase f f^^f^jr^formed into the 

KSR, then the polymerase """"^ J=^^"^^i/3,,,,„ed by plating on 
indicator strain. Mutators can be "^"^^^^ ^^^^^^ . „,tator 
erablnose/xgal Plates ^J^^-^:-: Lgh frequency of 
polymerases will give codon(s) . Candidate 

.,„e papillae due ^^^^^^ \ „on-papiUatins region 
papiUators can be rescreened by p colonies) and 

of the «=st heavily ^^^'''^^^\'°"'^^^J:Zi.^^o further screen 

i-u^ ra-r::^binose/Xqal indicator meaium 
replating on ^>^^;f ^^J/^j 'p,,iii3tion rates. . these steps are 

■ . ::e:r:::irrd::rrrd revLLon rate is achieved ..g.. ..-=.o 

candidates for encoding an error P^^/;^^^;;;/; J^ies ■ 
0 candidates are screened for "''--^-^fJ/^neLrlbed above. 

--T:z:z:^s:r:^:::^ - punfied m a 96. 

Briefly, mutant Taq p ^^^^ sequencing 

well format. The purified P to identify mutants 

reactions and the sequence data „„„„ts „ith 

vv^^h the improvements outlined herein, 
"d piU-lls^re subjected to KSH and rescreened for 
further improvements in function. 
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K^H^mpnts GFP containing etop codons instead 
In some embodiments, ^jtf = ^ 

^«r,= is used for the construction, 
of lacZ alpha with stop codons is usea ro ^ 
cells with reverted stop codons in GFP are selected by 
cells wiu sorter- (FACS) . In general, FACS 

«„cre.ce„ce .0 .va«a «U « ^^^^^^^^^ 

selection is performed by g 9 ^ acoordins to a 

preferably the "P'^ ^^^^^^^ et al., .Ca^ 2,.) :3.S.40X 
protoool similar to that of Da g ^^^^^ ^^^^ 

11982)) In other embodiments, the polA gene 
(19821). „f . site specific recombinase. The 

sites or other targets J i„,„,i,iy delete the 

.ecombinas. is ^"''"^■'^"l^^^^^ 23=485-490 (1«5„ . 
polygene (Hulbery et al., ^^^^t^;;^:;;;;7eelections at any 
.his »ould ^^^--J°Z'^Z.^^X. one could set up such a 
temperature ^ /^^^^/^^^^ mesophile or thermophile by placing 
selection m a deletable format and thus apply 

,,e pol. - - -r;fje under more general conditions^ 

,.e ^/,,rnts. this general system is preferred 

direct:: in vivo mutagenesis of gen.^ - -g-^^^^^ ^s^^ 
Cloned into the '^^^^^^ Ttrol of the error prone 

n;:::sr - ::s:rt^^^^^^^^ — » ^ poi..ts) re^ 
rtr:rd-g.o. at the -^^r::^-:^:^^ .^ 

specifically mutagenizing the target g 

rest of the plasmid with high ^-^^^^^l' ^ ^„ ,biUty 

in other embodiments, selection is based o 

mutant pol^erases to .CR amplify ^^^^f ;:;^..ases 
conditions or by utilizing ^^^^^^^^ a PCR amplification, thus 
act on the template that encodes them in a PCR P 

-""-'rbrrerrrn!triirrrcr::tants is repuca 

" Plated. preparations j^j: JTp^asmid 

crude plasmid preparations are -de of the same ^^^^ 
p„p is .CK-a^lified using ' ^ P°^vme"se prep^ 
plasmid under the conditions ^"-^ """^ . temperature of. 

pol^erase .e.g., f-^^ ^Jj: ' sa t::%c. with bas 
rarra- :throrr;./f:: use .th mass spectroscopy 
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sequencing, PGR of GC rich DNA (>60% GC),PCR with novel base 
analogs such as 7-deaza purines, 2' fluoro dNTP's^ rNTP's, PGR 
with inosine, etc.). The amplified genes are pooled, cloned, and 
subjected to mutagenesis, and the process repeated until an 
5 improvement is achieved. 

C. Evolved Phosphonatase 

Alkaline phosphatase is a widely used reporter enzyme 
for ELISA assays, protein fusion oassays, and in a secreted form as 

10 a reporter gene for mammalian cells. The chemical lability of p- 
nitrophenyl phosphate (pNPP) substrates and the existence of 
cellular phosphatases that cross-react with pNPP is an important 
limitation on the sensitivity of assays using this reporter gene. 
A reporter gene with superior signal to noise properties can be 

15 developed based on hydrolysis of p-nitrophenyl phosphonates, which 
are far more stable to base catalyzed hydrolysis than the 
corresponding phosphates. Additionally, there are far fewer 
naturally occurring cellular phosphonatases than alkaline 
phosphatases. Thus a p-nitrophenyl phosphonatase is an attractive 

20 replacement for alkaline phosphatase because the background due to 
chemical and enzymatic hydrolysis is much lower. This will allow 
one to make ELISA' s more sensitive for detecting very small 
concentrations of antigen. 

Chen et al. ( J. Mol . Biol. 234:165-178 (1993)) have 

25 shown that a Staph, aureus beta- lactamase can hydrolyze p- 

nitrophenyl phosphonate esters with single turnover kinetics. The 
active site Ser70 (the active site nucleophile for beta lactam 
hydrolysis) forms a covalent intermediate with the substrate. 
This is analogous to the first step in hydrolysis of beta lactams, 

30 and this enzyme can be evolved by RSR to hydrolyze phosphonates by 
a mechanism analogous to beta lactam hydrolysis. Metcalf and 
Wanner have described a cryptic phosphonate utilizing operon (phn) 
in E. coli, and have constructed strains bearing deletions of the 
phn operon (J. Bact;. 175:3430-3442 (1993)). This paper discloses 

35 selections for growth of E. coli on phosphate free minimal media 
where the phosphorous is derived from hydrolysis of ■ alkyl 
phosphonates by genes in the phn operon. Thus, one could select 
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for evolved p-nitrcphe„yl phosphonatases that are """^ 

e„icient phosphonatase is evolved as follows. A Ubrary o£ 

. of-nh aureus beta lactamase or of one oE tne t. 
mutants of the Staph. transformed into 

5 coll phn enzymes is constructed. The liorary _ 

I Ju mutants wherein the phn operon has been deleted, and 
.elected for .rowth on phosphate free MOPS ^/^^ 

:::::::: rrrrvrrh:ry:;fo-:ord hydroLis of p. 

10 nitrophenyl phosphonates . 

D ^rr.^.r^A Pet — Proteases „ ^„ 

linases are added in large quantities to 

Proteases and lipases aic • ' . ^„ 

aete..ents to en.y»atically degrade P""^" -^^/f^t/^rs 

" ^^"Tican:; ::rrhrne:: rL^ri: rtUents, with 

significantly reduceo t tormulation of detergents 

a conse^ent -/""^^ -/^t:! "rtlties. Proteases with 

'■"P~^^,f;;,::tty improved range of protein substrate 
improved specitic activity, f ^^=w-iT-ii-v at elevated 

a„ specificity, in^roved shelf life, ^'^^''^^ add 
temperature, and reduced requirements for surfactan 

.3lue to ^ .,„,.ed as foll^s^ 

..e Cloned ^^^^^^^^'^^^^^ ^z:::^^^ 

oc, 98-55-68 (1993)) can be subiected to RSR using g 

" : com le. protein media by virtue of secreted subti isin 

degrading the complex protein mixture. More =P-"'^""^;^^^. 

uLaries of subtilisin mutants --^^^::l:^::JZi:7Z.. 

vector which directs the mutant P-«-." ^^Z? ""les are grown 
30 subtilus. Bacillus hosts transformed ^^^^f t:,,^ /.a/or 

- — ''Z:^^^::::^^^^ .-rs 

rXreHo r then screened for improvement in a desired 
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performed as follows. A library of mutant protease genes is 
constructed on a display phage and the phage grown in a multiwell 
format or on plates. The phage are overlayed with a "protein.net" 
which ensnares the phage. The net can consist of a protein or 
proteins engineered with surface disulphides and then crosslinked 
with a library of peptide linkers. A further embodiment employs 
an auxiliary matrix to further trap the phage. The phage are 
further incubated, then washed to collect liberated phage wherein 
the displayed protease was able ^o liberate the phage from the 
protein net. The protease genes are then subjected to RSR for 
further evolution. A further embodiment employs a library of 
proteases encoded by but not displayed on a phagemid wherein 
streptavidin is fused to pIII by a peptide linker. The library of 
protease mutants is evolved to cleave the linker by selecting 
15 phagemids on a biotin column between rounds of amplification. 

In a further embodiment, the protease is not necessarily 
provided in a display format. The host cells secrete the protease 
encoded by but not surface diplayed by a phagemid,. while 
constrained to a well, for example, in a microtiter plate. Phage 
display format is preferred where an entire high titre lysate is 
encased in a protein net matrix, and the phage expressing active 
and broad specificity proteases digesting the matrix to be 
liberated for the next round of amplification, mutagenesis, and 
selection. 

25 In a further embodiment, the phage are not constrained to a 

well but, rather, protein binding filters are used to make a 
colony of plaque lifts and are screened for activity with 
chromogenic or fluorogenic substrates. Colonies or plaques 
corresponding to positive spots on the filters are picked and the 

30 encoded protease genes are recovered by, for example, PGR. The 
protease genes are then subjected to RSR for further evolution. 

F, Screens for Improved Protease Activity 

Peptide substrates containing fluoropores attached to 
35 the carboxy terminus and fluorescence quenching moities on the 

amino terminus, such as those described by Holskin, et al, ( Anal. 
Biochem. 227:148-55 (1995)) (e.g., (4-4/- 
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din,ethylaminophenazo)ben2oyl-arg-gly-val-val-asn-ala-ser-ser-arg- 
leu-ala-5- (2 • -aminoethyl) -amino] -naphthalene-l-sulfonic ac.d) are 
used to screen protease mutants for broadened or altered 
specificity, in brief, a library of peptide substrates is 
designed with a flourophore on the amino terminus and a potent 
fluorescence quencher on the carboxy terminus, or vice versa^ ^ 
supernatants containing secreted proteases are xncubated exther 
separately with various members of the library or with a complex^ 
cocktail. Those proteases which are highly active and have broad 
specificity will cleave the majority of the peptides, thus 
releasing the fluorophore from the quencher and giving a po3itive 
signal on a fluorimeter. This technique is amenable to a high 
density multiwell format. 

G T^ pr-^^-ina ph p ^^^'-'ii-^^al protPiTiB using RSR 

Table I lists proteins that are of particular commercial 
interest to the pharmaceutical industry. These proteins are all 
candidates for RSR evolution to improve function, such -^^^^"^ 
ending, shelf life, reduction of side effects through enhanced 
specificity, etc. All are well-suited to manipulation by. the 
techniques of the invention. Additional embodiments especially 
applicable to this list are described below. 

First, high throughput methods for expressing and 
purifying libraries of mutant proteins, similar to the methods 
des ribed above for Taq polymerase, are applied to the proteins of 
Table I. These mutants are screened for activity in a functional 
assay For example, mutants of IL2 are screened for resistance to, 
Tlasla or tissue proteases with retention of activity for the low 
affinity IL2 receptor but with loss of activity on the high 
, affinity IL2 receptor. The genes from mutants with -P^^^f 

activity relative to wild-type are recovered, and subjected to RSR 

to improve the phenotype further. . , .,s«r^lav 

Preferably, the libraries are generated in a display 
format such that the mature folded protein is physically linked to 
, he genetic information that encodes it. Examples include phage 
display using filamentous phage (O-Neil et al.. ^^^^ 
5 ! !I49 (1^95)) or bacteriophage lan^da gene V display (Dunn. J. 
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MoT Biol.. 248:497-506 (1995)). peptides on plasmids (Gates et 
al , -T Mnl. Biol. 255:373-386 (1995)) where the polypeptide of 
interest is fused to a lac headpiece dimer and the nascent 
translation product binds to a lac operator site encoded on the 
plasmid or PGR product, and polysome display (Mattheakis et al.. 

Acad m.S.A.) 91:9022-9026 (1994)) where 

ribosomes are stalled on mRNA molecules such that the nascent ■ 
polypeptide is exposed for interaction with cognate ligands 
without disrupting the stalled ribosome/mRNA complex. Selected 
complexes are subjected to RT-PCR to recover the genes. 

When so displayed, affinity binding of the recombinant 
phage is often done using a receptor for the protein of interest, 
in some cases it is impractical to obtain purified receptor with 
retention of all desired biological characteristics (for example. 
7-transmembrane (7-™) receptors) . In such cases, one could use 
cells expressing the receptor as the panning substrate. For 
example. Barry et al. (Nat. Med. 2:299-305 (1996)) have described 
successful panning of M13 libraries against whole cells to obtain 
phage that bind to the cells expressing a receptor of interest. 
This format could be generally applied to any of the proteins 

listed in Table I. ' ^ 

in some embodiments, the following method can be used 
for selection. A lysate of phage encoding IFN alpha mutants, for 
example, can be used directly at suitable dilution to stimulate 

5 cells with a GFP reporter construct (Crameri et al.. Nat. Med. . 
14-315-319 (1996)) under the control of an IFN responsive 
promoter, such as an MHC class I promoter. Phage remaining 
attached after stimulation, expression and FACS purification of 
the responsive cells, can be purified by FACS. Preferably the 

0 brightest cells are collected. The phage are collected and their 
DNA subjected to RSR until the level of desired improvement is 

achieved. ^ 

Thus, for example. IL-3 is prepared m one of these 
display formats and subjected to RSR to evolve ■ an agonist with a 
,5 desired level of activity. A library of IL3 mutants on a 
filamentous phage vector is created and affinity selected 
("panned") against purified IL3 receptor to obtain mutants with 
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improved affinity. The mutant IL-3 genes are recovered by PGR, 
subjected to RSR, and recloned into the display vector. The cycle 
is repeated until the desired affinity or agonist activity is. 
achieved. 

5 Many proteins of interest are expressed as dimers or 

higher order multimeric forms. In some embodiments, the display 
formats descibed above preferentially are applied to a. single 
chain version of the protein. Mutagenesis, such as RSR, can be 
used in these display formats to evolve improved single chain 
10 derivatives of multimeric factors which initially have low but 
detectable activity. This strategy is described in more detail 
below. 

H. Whole Cell Selections 

15 In some embodiments, the eukaryotic cell is the unit of 

biological selection. The following general protocol can be used 
to apply RSR to the improvement of proteins using eukaryotic cells 
as the unit of selection: (1) transf action of libraries of 
mutants into a suitable host cell, (2) expression of the encoded 

20 gene product (s) either transiently or stably, (3) functional 

selection for cells with an improved phenotype {expression of a 
receptor with improved affinity for a target ligand; viral 
resistance, etc., (4) recovery of the mutant genes by, for 
example, PGR followed by preparation of HIRT supernatants with 

25 subsequent tranf ormation of B. coli, (5) RSR and (6). repetition of 
steps (1) - (5) until the desired degree of improvement is 
achieved. 

For example, previous work has shown that one can use 
mammalian surface display to functionally select cells expressing 

30 cloned genes, such as using an antibody to clone the gene for an 
expressed surface protein (Reviewed by Seed, Curr. Opin. 
Biotechnol . 6:567-573 (1995)). Briefly, cells are transiently 
transfected with libraries of cloned genes residing, on replicating 
episomal vectors. An antibody directed against the protein of 

35 interest {whose gene one wishes to clone) is immobilized on a 
solid surface such as a plastic dish, and the transfected cells 
expressing the protein of interest are affinity selected. 



66 



For e:<an,ple, the affinity of an antibody for a ligand 
can be i»,proved using manunalian surface display and RSR 
^uLdies with higher affinity for their cognate ligands are then 
screened for improvement of one or .or. of the £o low.ng 
properties, ,1) improved therapeutic properties (xncreased cell 
killing, neutralization of ligands, activation of signal 
tra«=du;tion pathways by =rosslin.ing receptors,, ,2. improved in 
vivo imaging applications (detection of the antibody by 
covalent/noncovalent binding of a radionuclide or any agent 
detectable outside of the body by noninvasive means, such as N«R> , 
1 improved analytical applications (ELISI. detection of proteins 
or small molecules) , and (4) improved catalysts (catalytic 
antibodies) . The methods described are general and can be 
extended to any receptor-ligand. pair of interest. A specific 
examnle is provided in the experimental section. 

L use of a one mutant sequence-one transfected cell 
protocol is a preferred design feature for RSR based protocols 
because the point is to use functional selection to identify 
^^talts 1th i^roved phenotypes and, if the transfection is not 
irln a "Clonal- fashion, the functional phenotype of any given 
cell is the result of the sum of many transfected sequences, 
protoplast fusion is one method to achieve this end, since each 
protoplast contains typically greater than 50 copies each of 
single plasmid variant. However, it is a relatively low ^ 
single p ^ ^ transf ectanta) , and it does 

efficiency process (about lu. lu v. _ o ,.=11 

not worlc well on some non-adherent cell lines such as » 
lis Retroviral vectors provide a second alternative, but they 
are limited in the size of acceptable insert '^^) and 

consistent, high expression levels are sometimes difficult to 
achieve. Random integration results in varying expression levels, 
.hu introducing noise and limiting one's ability to ai«inguish 
between improvements in the affinity of the mutant protein vs. 
increased expression. . related class of strategies ^ 
:ed effectively to achieve -one gene-one cell. DNA "ansfer and 
5 consistent expression levels for RSR is to use a -^"^ 
which contains a lox site and to introduce this into a host that 
Expresses ere recombinase, preferably transiently, and contains 
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one or .ore lox sites integrated into its gence, ^^^'^ 
the variability of integration sites (Rohlman et al. 

Biotech. 14:1562-1565 (1996)). ..^^^fect with limiting 

An alternative strategy xs to transfect wicn 
An aitern.* ^^.^g a 

concentrations of plasmid (x.e.. ^^^^ .3 case 

vector that can replicate in the target cells such - - 
With plasmids bearing SV40 origins — ° ^ 
This Strategy requires that either the host cell or the vec 

. a re^Ucation factor such as SV40 large T antxgen. 
supply a replcatxon 268:2917-2923 (1993)) describe a 

Northrup et al. ( j Bi o 1 . 

V. stable transfectant expressing SV40 large 1 

Xerr^e vjrrcLa .... ..^^rs ^.^.r. sv,o 

/format sav. co„.is.enUy highe. '"-^^ 7;™,;^ 
ae^nstrable plasMd .epUcation, as assay a by -"'-"^ 

T T-ransient expression U-e, non j-iiv-^^ 
. ai.astio„ y y ,1 display sel^c^ons 

^ar it .irAne cycU .i.e ana increases nu^e. o. 

„ .epUca.on .«o.s .ay .a. ^-^2^;^ Z TppU Jtf 
,„e«icie„txy in ao e c n e ir/ltan.s with i^rovea activity 
irr::n i:te.est. . .e„e.ic p^otoc. .o. evo.vin. 

^ :L::t"cr:s t.an..ectea Wit. cxonaa onto a 

.ector containing SV40 large T antigen, ^f/^^^ //^t^: 

.ene sue. as J. a .eiate --J^— 1 

. . rrssTa-e::.::: as l =-a - an sv. o i. 

1 ^ri Tvnicallv after 1-10 days of transient 
30 containing plasmid. Typically a 

e=,.ession. ^ to 

ir/ere^rt: i. — - — — 

improvement is obtained. 

35 

ir;;;;^!;;^;!!;^. -tant proteins are selectea or 
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screened based on their ability to exert a biological effect xn an 
autocrine fashion on the cell expressing the mutant protexn For 
example, a library of alpha interferon genes can be selected for 
induction of more potent or more specific antiviral activity as 
5 follows. A library of interferon alpha mutants is generated in a 
vector which allows for induction of expression (i.e. under 
control of a metallothionein promoter) and efficient secretion m 
a multiwell format (96-well for example) with one or a few 
independent clones per well. In .some embodiments, the promoter is 
10 not inducible, and may be constitutive. . . ^ . 

Expression of the cloned interferon genes is induced. 
The cells are challenged with a cytotoxic virus against which one 
wishes to evolve an optimized interferon (for example vesicular 
stomatitus virus or HIV) . Surviving cells are recovered. The 
15 cloned interferon genes are recovered by PGR amplification, 

subjected to RSR. and cloned back into the transfection -ctor and 
retransfected into the host cells. These steps are repeated until 
the desired level of antiviral activity is evolved. 

in some embodiments, the virus of interest is not • 
20 strongly cytotoxic, m this case a conditionally lethal gene 

such as herpes simplex virus thymidine kinase, is cloned into the 
virus and after challenge with virus and recovery, conditionally 
lethal selective conditions are applied to kill cells that are 
infected with virus. An example of a conditionally lethal gene is 
25 herpes TK, which becomes lethal upon treating cells expressing 
this gene with the thymidine analog acyclovir. In some 
embodiments, the antiproliferative activity of the cloned 
interferons is selected by treating the cells with agents that 
kill dividing cells (for example, DNA alkylating agents) . 
,0 in some embodiments, potent cytokines are selected by 

expressing and secreting a library of cytokines in cells that have 
GFP or another reporter under control of a promoter that is 
induced by the cytokine, such as the MHC class I promoter being 
induced by evolved variants of alpha interferon. The signal 
35 transduction pathway is configured such that the wild type 
cytokine to be evolved gives a weak but detectable signal. 



J. nr.-" S^"" . „ 

in some en^cdlments of the invention, proteine are 

evolved by RSR to have improved half life in ser^m A 

. „ h=,lf life is evolving the affinity of a 

method for improving half-lite is evo a . 

S protein of interest for a Ion, lived serum prote n. such as 
antibody or other abundant serum protein. Examples of how 
affinity for an antibody can enhance serum half life 
CO administration of IL2 and anti-1.2 antibodies «>>-^---- 
erum half -life and anti-tumor activity ''^^'^'T 

,„ icourtney et al., 28.223-232 (1994)). 

,0 (Courtney et ^ -^^^^f-^^;;^^,,^ ,erum proteins are serum 

albumin, immunoglobulins, lipoproteins, haptoglobin, fibrinogen, 
«ns£e rin, alpha-i antitrypsin, and alpha -2 '^""^^^^'^^ 

^ . a ThP PlasmaProteins F. Putnam, ed. ; 
.Boolittle, chap er , ^^^^^^ ^^ ^^^ p,«eins 

15 Academic ' ^^^'^^^''^.i^ the primary targets 

Ti: t" i~:es on therapeutic proteins .ucn 
rr alu 1 for the purpose of extending -"-"^^ ^ 

antibodies, the pr.ferre^ -at^^^ tl^^jTU^illli.e 
- rcrcelatio! Of the relevant target 

:;ftope (antibody V region usage between different individuals is 

^^-""nfndrnrrel-of the desired affinity are evolved by 

applying phage display, peptides "'^^^^ Z.T::Zr 

aisplay --"--;:;:tX":;Arorh:"ise defined region of 
mutagenize an existing binding ° terminus, 

the target protein, or append a peptide library to the 
the targe p functionally nondisruptive loop. 

C terminus, or internaiiy <■= ^. v.it nf. is 

in other embodiments of the invention, halt life is 
improved by derivatization with PEG, other polymer -"^^^at- or 
Tlf-life extending chemical moieties. These are ""^ 
methods for extending half -life of therapeutic proteins (R. 
"Ln aiH^-arnXmasolOB^ 27,290-306 (1994),- Smith et al. 

r.::;:^ t:errriv::iror;an^^ 
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affinity of the therapeutic protein for its receptor or ligand. 
RSR is used to discover alternative sites in the primary sequence 
that can be substituted with lysine or other appropriate residues 
for chemical or enzymatic conjugation with half- life; extending 
5 chemical moieties, and which result in proteins with maximal 
retention of biological activity. 

A preferred strategy is to express a library of mutants 
of the protein in a display format, derivatize the library with 
the agent of interest (i.e. PEG) 'using chemistry that does not 

10 biologically inactivate the display system, select based on 

affinity for the cognate receptor, PGR amplify the genes encoding 
the selected mutants, shuffle, reassemble, redone into the 
display format, and iterate until a mutant with the desired 
activity post modification is obtained. An alternative format is 

15 to express, purify and derivatize the mutants in a high throughput 
format, screen for mutants with optimized activity, recover the 
corresponding genes, subject the genes to RSR and repeat. 

In further embodiments of the invention, binding sites 
for target human proteins that are localized in particular tissues 

20 of interest are evolved by RSR. For example, an. interferon that 
localizes efficiently to the liver can be engineered to contain a 
binding site for a liver surface protein such as hepatocyte growth 
factor receptor. Analogously, one could evolve affinity for 
abundant epitopes on erythrocytes such as ABO blood antigens to 

25 localize a given protein to the blood stream. 

In further embodiments of the invention,, the protein of 
interest is evolved to have increased stability to proteases. For. 
example, the clinical use of IL2 is limited by serious side 
effects that are related to the need to administer high doseg. 

30 High doses are required due to the short half life (3-5 min, Lotze 
et al., JAMA 256 (22) : 3117-3124 (1986)) and the consequent need for 
high doses to maintain a therapeutic level of IL2. One of the 
factors contributing to short half- lives of therapeutic proteins 
is proteolysis by serum proteases. Cathepsin D, a major renal 

35 acid protease, is responsible for the degradation of IL2 in Balb/c 
mice (Ohnishi et al., Cancer Res. 50:1107-1112 (1990)). 
Furthermore, Ohnishi showed that treatment of Balb/c mice with 
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■ >,^v,^t«r of this protease., prolongs the half 
pepstatin. a potent .nh.b. or ofj^^ ,^p,o.ine- activated 
life of recombinant human IL2 ana auy 

Wn.r cell activity in t.is j/f^^^^,,^, XLa or 

Thus, ,,,, are r.sista„t to seru™ 

of the protein. -//^^/^ staining variants 

or kidney proteases is a preferrea 

with extended serum half lives. ubrary of the 

. A preferred protocol xs as follows. 

• interest is expressed m a dispj-ay y 

mutagenized protern of interest P „j ^ phage 

with a gene-distal -P^-^PJ ''.^ off by proteases, the 

display construct such „eated with 

epitope tag is lost) . The expr ^ „hole . human 

defined proteases or „ith ^''"^^^l^^^^ ^ ^ distal tag 

serum. Affinity ----^^^f^^drrdLg hiological function 
is performed. * -=°"^/;'""tor) is performed.: . Phage retaining 
.e.g., binding to «a recovered and 

the epitope tag (and hence ^ desired level 

subjected to RSR. The process is repeated 

of resistance, is _ procedure is performed in a 

In other embodiments, tne 

^ r>.A tn have increased shelf life., a j 
S interest - . ^/^^^ ^f;,, 3^ence encoding the protein of 

:::::::: "^.r:^^ — r :rr:s;''^"' 

. -I '^^1 T^H of for example, <6 or >o, 
,0 ions, nonphysioiogical pH of, t recovered from from 

lyophilization, • ;L subjected, to 

rare::.:"- the .ocess repeated until the 

desired level of Improvement is achieved. 

of the 
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substrate for evolution by RSR is preferably a 
invention, the substrate ^j^jn perforrilng asymetric 

..„,le Chain contruct -^^P"-^^^^^^^ ^,„,,,es 

mutagenesis on ° „„i„tion of such constructs 

i.portant new Pathways for -th«^ ^^^^^^^ ,„«™ulti.eric 

, that is not open to the .„ , ,,„,„„itimer will 

ruinrtTa: eVinH^^^^^^ each identical subunit. Xn 
resuic 1 , however, the domains can mutate 

single chain constructs, however, 

^^^-^-ZJ^^^Z...^^^ P-eins to Sin., cha. ^ 

tax a number of proteins. ainqle chain Fv's 

ohain -----7r.r:.r.;u;«:,ranr:his strategy ha. 
(Bird et al., Science 242. 42J *z ^ stability (Young at 

.3 .esultea in antiboaies --^-f-^^rsrs ^u "" P-eolysis 
al., ZEE^ "':"',^'o 2 (l.SS„. A functional single 

'^•'.^tr^oLdiL. has been constructea, shown to 
chain version of IL5, a nomuvi 

.ave affinity for the XLS receptor ^^^^J ^^^^^^^ 

,0 protein, and this -struct ^ been "--^ ^.^^..-^S^O 
mutagenesis of the dimer JJ^^ 

(X996)) . A single chain .^.e single 

activator has been made, and it has been. ^..^^.^r 
accivduu ^„c=-i«t-ant to plasminogen activator 

chain construct is more resistant to p 

1 i-han the native homodimer (Higazi eu 
25 inhibitor type 1 than the n ^.^^.^^ insulin-like growth 

87.3545-3549 (1996)). Finally, a sing 

,.ctor X/insuUn hybria has been ^^'^^TreZ;^^ ,,,, 
higher affinity fox chimeric xnsulin/IGF 1 rec P 

either natural ligana (Kristensen et al., 

" m general, a linker is constructea Which ioins the 

amino terminus of one subunit of a protein ^^^^^^^^ ^jXsi^ 
carboxyl terminus o£ another subunit in the con^lex. The 
carboxyi t imked versions of homodimers, 

proteins can consist of linked h.teron«iltlmers . In 

3S homomultimers. »«ero i»e o h^ H ^^^^^^^ ^^^^^^ ^^^^^^ 

rilftr^eT-r rsUi«nt variations.can.be .ade. 
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Pir=t, one can construct diverse libraries of 

wild type sequence in and around the junctrons and in the linkers 
facilitate the construction ol active fusion proteins. 

c:::: Lan, et ai., "■"^"-^":L:r:hi'n 

aescribod circular permutations of T4 lysozyme in wh ch the native 
amino and carboxyl «™ini have been joined and 
carboxyl termini have been engineered into the protein. The 
methods of circular permutation, libraries of linkers, and 
libraries of functional sequences flanking the Unkere aUo^ one 
to construct libraries that are diverse in topological linkage 
strata s and in primary sequence. These libraries 
and selected for activity. Any of the above mentioned strategies 
£oi screening or selection can be used, with phage display being 
pref rab e in most cases. Genes encoding active, fusion proteins 
fre recovered, mutagenlzed. reselected. and subjected to standard 
RSR protocols to optimize their function. Preferably, a _^ 
population of selected ™tant single chain constructs is PCR 
ailified in two seprate PCR reactions, such that each of the two 
ampiliieo , . . . , ,.„^„t,iv oligonucleotides are derived 

r L sepa.a.e.v a.pXifie^ acains ^iJ-^]^^ ^^^^^^ 
reactions, then the two populations are recon^xned -^^^^ 
reassenOoly to generate intact single chain constructs for further 
rounds of selection and evolution. 



y. 

A 



r^ ^^^.r.^ Prone r -^- Pb.rm.rPUti ral Proteins . 

r m rfl ^r-'^'-'^- Fnrnrfor or CpIT Type of 



Interest 

The majority of the proteins listed in Table I are 
either receptors or ligands of pharmaceutical interest. Many 
a on ts s^ch as chemokines or interleukins agonize more than one 
receptor. Evolved mutants with improved specificity may have^ 
a fed side effects due to their loss of activity on receptors 



which are implicatea in a particular side effect profile 
:Lt of these ligana/receptors. mutant forms with improved 
affinity would have improved pharmaceutical properties. F< 
IZTJ. 7. antagonistic form of ^ with improved affinity for 



For 
For 
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CKR5 should be an improved inhibitor of HIV infection by virtue of 
achieving greater receptor occupancy for a given dose of the drug. 
Using the selections and screens outlined above in combination 
with RSR, the affinities and specificities of any of the proteins 
5 listed in Table I can be improved. For example, the mammalian 

display format could be used to evolve TNF receptors with improved 
affinity for TNF. 

Other examples include evolved interferon alpha variants 
that arrest tumor cell proliferation but do not stimulate NK 

10 cells, IL2 variants that stimulate the low affinity IL2 receptor 
complex but not the high affinity receptor (or vice versa) , 
superantigens that stimulate only a subset of the V beta proteins 
recognized by the wild type protein" (preferably a single V beta), 
antagonistic forms of chemokines that specifically antagonize only 

15 a receptor of interest, antibodies with reduced cross-reactivity, 
and chimeric factors that specifically activate a particular 
receptor complex. As an example of this latter case, one could 
make chimeras between IL2 and IL4, 7, 9, or 15 that also can bind 
the IL2 receptor alpha, beta and. gamma chains (Theze et al., imm. 

20 Today 17:481-486 (1996)), and select for chimeras that retain 
binding for the intermediate affinity IL2 receptor complex on 
monocytes but have reduced affinity for the high affinity IL2 
alpha, beta, gamma receptor complex on activated T cells. 

B. Evolved Agonists with Increased Potency 

In some embodiments of the invention, a preferred 
strategy is the selection or screening for mutants with increased 
agonist activity using the whole cell formats described above, 
combined with RSR. For example, a library of mutants of IL3 is 
expressed in active form on phage as described by Gram et al. ( J. 
Immun. Meth. 161:169-176 (1993)). Clonal lysates resulting from 
infection with plaque-purified phage are prepared in a high 
through-put format such as a 96 -well microtiter format. An IL3- 
dependent cell line expressing a reporter gene such as GFP is 
stimulated with the phage lysates in a high throughput 96-well. 
Phage that result in positive signals at the greatest dilution of 
phage supernatants are recovered; alternatively, DNA encoding the 



30 



35 



75 



J K„ PCS in some embodiments, single 
.uatant IL3 can be recovered W PCR „,p„„ive promoter 

.ens ^-"-f ^::rr.3";;:;e U-rv. and t^e positive ..C 
can stimulated wxth to PCR, «.d the 

::rc::s rrerdrtirriled ..l o. i.ro.em.nt is 



obtained. 



Table I 



10 Name 

Alpha- 1 antitrypsin 

Angiostatin 

Antihemolytic factor 

Apolipoprotein 

15 Apoprotein 

Atrial natriuretic factor 
Atrial natriuretic polypeptide 

Atrial peptides 3.,,.,, Q^o-b. Gro- 

C-X-C chemokines (e.g.. 
20 c, IP-10. GCP-2. NAP-4, SDF-1. PF4. MIG) 

calcitonin chetnoattractant protein^l. Monocyte 

cc che.oKines e Mo- y^_^^ ^^^^^^^^ ..e.oattractant protein- 
chetnoattractant P alpha, Monocyte 

HCCl. T58847. D31065. T64262) 
Cb40 ligand 
Collagen 

colony stimulating factor (CSF) 
30 Complement factor 5a 
Complement inhibitor 
Complement receptor 1 
Factor IX 
Factor VII 
35 Factor VIII 
Factor X 
Fibrinogen 
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Fibronectin 
Glucocerebrosidase 

Gonadotropin 

Hedgehog proteins (e.g., Sonic. Indian, Desert) 
S Hemoglobin (for blood substitute,- for radiosensitization) 

Hirudin 

Human serum albumin 
Lactoferrin 
Luciferase 
10 Neurturin 

Neutrophil inhibitory factor (NIF) 
Osteogenic protein 

Parathyroid hormone 

Protein A 
15 Protein G 

Relaxin 

Renin 

Salmon calcitonin 
salmon growth hormone 
20 soluble complement receptor I 

Soluble I-CAM 1 o a ^ 6 7 9, 10, 11, 

soluble interleukin receptors (IL-1, 2. 3, 4, 5, 6, 1. 

12, 13, 14, 15) 
Soluble TNF receptor 
25 Somatomedin 
Somatostatin 
Somatotropin 

streptokinase „3„hylococcal enterotoxlns (SEA, SEB, SECl, 

^ • TV ,r,^ n Pvroaenic exotoxins A, B, ana l., 

Exfoliating toxins A and B, Pyrogenic 

and M. arthritidis mitogen 
Superoxide dismutase 
Thymosin alpha 1 
35 Tissue plasminogen activator 

Tumor necrosis factor beta (TNF beta) 
Tumor necrosis factor receptor (TOFR) 



0 
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Tumor necrosis factor-alpha (TOF alpha) 
Urokinase 

C nf Co mponPTif B of Eukaryotic Signal 

;;~;g the screens and Illl^^ons listed above, RSR can 
. ™«^^f^7 Pukarvotic signal transduction 
be used in several ways to modify eukaryocic ^ 
or transcriptional pathways. Any component of a signal 
or transctipuj. ^ ^v,^ -rpaulatorv regions and 

chemicals that induce transcription can be f 

^^^^^ 

optimization of diverse assays validated 

=™r,iP dozens of 7 transmembrane receptors (7-TM) are a 
example, dozens oi ^^amole Siderovski et al., 

targets for drug discovery (see. for ^^^"^P^^' 

u.ol 6(2) •211-212 (1996); An et al . , FF.PS Lett. , 375 (1 
Cj^rr^iSl^. 6(2).211 ^ 163 (2) : 295-299 (1995) ; 

2) -121-124 (1995); Raport et al.. Gene, ^ ' 

, oo (2) -347-349 (1995) ; Strader et al. FASia 

song et al.. Genomics, 28(2) 347 ^ 363 (i-2) :49-52 

T q (9) -745-754 (1995); Benka et al . , FF.BS Lecc ^. 
n;95 sptegel. o^^lmE^^ ^ 

1 , .niLN^A..^^' 730:168-175 (1996); Spiegel. 
^r7777^;;:^l996) ; Barak et al. . B^ssto^. 

407-15414 (1995) ; and Shenker, Milli^^^^^ 
34 (47). ib^ or-^v. 427-451 (1995)). The development of 

these receptors is essential for expioic g 

combinatorial chemistry in ^^^l°:'^^2r7t T:Z.n. chemicals 
Additionally, biodetectors or biosensors £or ^^"^ 

can be developed by evolvin, 7.™.s '^^^^'^XZcX^^^^^on 
„o.el Chemicals or proteins o. .nter . ^ ^ J^^^ 

"""" ".rtlteTe c re lin. could be done singly with 

polypeptide '"Jl^^^l'; ^^^^^^ ,,,, since the desired 

fluorescence or lignt: activate 
improvement is coupled to light production. 
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In addition to detection of small molecules such as 
pharmaceutical drugs and environmental pollutants, biosensors can 
be developed that will respond to .any chemical for which there are 
receptors, or for which receptors can be evolved by recursive 
5 sequence recombination, such as hormones, growth factors, metals 
and drugs. The receptors may be intracellular and direct 
activators of transcription, or. they may be. membrane bound 
receptors that activate transcription of the signal indirectly, 
for example by a phosphorylationo cascade. They may also not act 

10 on transcription at all, but may produce a signal by some post - 
transcriptional modification of a component of the signal 
generating pathway. These receptors may also be generated by 
fusing domains responsible for binding different ligands with 
different signalling domains. Again, recursive sequence 

15 recombination can be used to increase the amplitude of the signal 
generated to optimize expression and functioning of chimeric 
receptors, and to alter. the specificity of the chemicals detected 
by the receptor. 

For example, G proteins can be evolved to efficiently 

20 couple mammalian 7-TM receptors to yeast signal transduction 

pathways. There are 23 presently known G alpha protein loci in 
mammals which can be grouped by sequence and functional similarity 
into four groups, Gs (Gna, Gnal) , Gi (Gnai-2, Gnai-S, Gnai-1, 
Gnao, Gnat-1, Gnat-2, Gnaz) , Gq (Gnaq, Gna-11, Gna-14, Gna-15) and 

25 G12 (Gna-12, Gna-13) (B. Nurnberg et al., J . Mol . Med . . 73:123-132 
(1995)). They possess an endogenous GTP^ase activity allowing 
reversible functional coupling between ligand-bound receptors and 
downstream effectors such as enzymes and ion channels. G alpha 
proteins are complexed noncovalently with G beta and G gamma 

30 proteins as well as to their cognate 7-TM receptor (s) . Receptor 
and signal specificity are controlled by the particular 
combination of G alpha, G beta (of which there are five known 
loci) and G gamma (seven known loci) subunits. Activation of the 
heterotrimeric complex by ligand bound receptor results in 

35 dissociation of the complex into G alpha monomers and G beta, 
gamma dimers which then transmit signals by associating with 
downstream effector proteins. The G alpha subunit is believed to 
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be the subunit that contacts the 7-TM, and thus it is a focal 
point for the evolution of chimeric or evolved G alpha subunits 
that can transmit signals from mammalian 7-TM's to yeast 
downstream genes, 
5 Yeast based bioassays for mammalian receptors will 

greatly facilitate the discovery of novel ligands. Kang et al. 
(Mol. Cell Biol. 10:2582-2590 (1990) ) have described the partial 
complementation of yeast strains bearing mutations in SCGl (GPAl) , 
a homologue of the alpha subunits of G proteins involved in signal 

10 transduction in mammalian cells, by mammalian and hybrid 

yeast /mammalian G alpha proteins • These hybrids have partial 
function, such as complementing the growth defect in scgl strains, 
but do not allow mating and hence do not fully complement function 
in the pheromone signal transduction pathway. Price et al. ( Mol. 

15 Cell Biol. 15:6188-6195 (1995)) have expressed rat somatostatin 
receptor subtype 2 (SSTR2) in yeast and demonstrated transmission 
of ligand binding signals by this 7-TM receptor through yeast and 
chimeric mammalian/yeast G alpha subunits ("coupling") to a HISS 
reporter gene, under control of the pheromone responsive promoter 

20 FUS-1 enabling otherwise HIS3(-) cells to grow on minimal medium 
lacking histidine. 

Such strains are useful as reporter strains for 
mammalian receptors, but suffer from important limitations as 
exemplified by the study of Kang et al., where there appears to be 

25 a block in the transmission of signals from the yeast pheromone 
receptors to the mammalian G proteins. In general, to couple a 
mammalian 7-TM receptor to yeast signal transduction pathways one 
couples the mammalian receptor to yeast, mammalian, or chimeric G 
alpha proteins, and these will in turn productively interact with 

30 downstream components in the pathway to induce expression of a 
pheromone responsive promoter such as FUS-1. Such functional 
reconstitution is commonly referred to as "coupling" . 

The methods described herein can be used to evolve the 
coupling of mammalian 7-TM receptors to yeast signal transduction 

35 pathways. A typical approach is as follows: (1) clone a 7-TM of 
interest into a yeast strain with a modified pheromone response 
pathway similar to that described by Price (e.g., strains 
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deficient in FARl, a negative regulator of cyclins, and. 
deficient in SST2 which causes the cells to be hypersensitive to 
the presence of pheromone) , (2) construct libraries of chimeras 
between the mammalian G alpha protein (s) known or thought to 
5 interact with the GPAl or homologous yeast G alpha proteins, (3) 
place a . selectable reporter gene such as HISS under control of the 
pheromone responsive promoter FUSl (Price et al., Mol. Cell Biol, 
15:6188*6195 (1995)). Alternatively, a screenable gene such as 
luciferase may be placed under the control of the FUSl promoter; 

10 (4) transform library (2) into strain (3) (HIS(-)), (5) screen or 
select for expression of the reporter in response to the ligand of 
interest, for example by growing the library of transf ormants on- 
minimal plates in the presence of ligand to demand HIS3 
expression, (6) recover the selected cells, arid and apply RSR to 

15 evolve improved expression of the reporter under the control of 
the pheromone responsive promoter FUSl . 

A second important consideration in evolving strains 
with optimized reporter constructs for signal transduction 
pathways of interest is optimizing the signal to noise ratio (the 

20 ratio of gene expression under inducing vs noninducing 

conditions) . Many 7>TM pathways are leaky such that the maximal 
induction of a typical reporter gene is 5 to 10-fold over 
background. This range of signal to noise may be insufficient to 
detect small effects in many high through put assays • Therefore, 

25 it is of interest to couple the 7-TM pathway to a second nonlinear 
amplification system that is tuned to be below but near the 
threshold of activation in the uninduced state. An example of a 
nonlinear amplification system is expression of genes driven by 
the lambda promoter. Complex cooperative interactions between 

30 lambda repressor bound at three adjacent sites in the cl promoter, 
result in very efficient repression above a certain concentration 
of repressor. Below a critical threshold dramatic induction is 
seen and there is a window within which a small decrease in 
repressor concentration leads to a large increase in gene 

35 expression (Ptashne, A Genetic Switch: Phage Lambda and Higher 
Organisms , Blackwell Scientific Publ . Cambridge, MA, 1992). 
Analogous effects are seen for some eukaryotic promoters such as 
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those regulated by GAL4. Placing the expression of a limiting 
component of a transcription factor for such a promoter (GAL4) 
under the control of a GAL4 enhanced 7-TM responsive promoter 
results in small levels of induction of the 7-TM pathway signal 
5 being amplified to a much larger change in the expression of a 
reporter construct also under the control of a ,GAL4 dependent 
promoter. 

An example of such a coupled system is to place GAL4 
under control of the FUS-1 pheromone responsive promoter and to 

10 have the intracellular GAL4 (itself a transcriptional enhancer) 

level positively feedback on itself by placing a GAL4 binding site 
upstream of the FUS-1 promoter.' A reporter gene is also put under 
the control of a GAL4 activated promoter. This system is designed 
so that GAL4 expression will nonlinearly self -amplify and co- 

15 amplify expression of a reporter gene such as luciferase upon 
reaching a certain threshold in the cell. RSR can be used to 
great advantage to evolve reporter constructs with the desired 
signaling properties, as follows: (1) A single plasmid construct 
is made which contains both the GAL4 /pheromone pathway regulated 

20 GAL4 gene and the GAL4 regulated reporter gene. (2) This 

construct is mutagenized and transformed into the appropriately 
engineered yeast strain expressing a 7-TM and chimeric 
yeast/mammalian protein of interest. (3) Cells are stimulated 
with agonists and screened (or selected) based on the activity of 

25 the reporter gene. -In a preferred format, luciferase is the 
reporter gene and activity is quantitated before and after 
stimulation with the agonist, thus allowing for a quantitative 
measurement of signal to noise for each colony. (4) Cells with 
improved reporter properties are recovered, the constructs are 

30 shuffled, and RSR is applied to further evolve the plasmid to give 
optimal signal noise characteristics. 

These approaches are general and illustrate how any 
component of a signal transduction pathway or transcription factor 
could be evolved using RSR and the screens arid selections 

35 described above. For example, these specific methods could be 

used to evolve 7-TM receptors with specificity for novel ligands, 
specificity of nuclear receptors for novel ligands (for example to 
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genes of Interest in transgenic Pl-»"' ^gent) , 

Us can be inaucea r sponsive to vir.l 

speciticity of transcrxptaon (actors to ^^^^^ 

S factors (t.us inducing -""^^ //J ^aL^enics or cells treated- 
e,cpressing this ' ^icity of transcription 

„lth gene therapy """""f ' / ,,,„ple p53. deficient . ^ 
factors for activity in „^ .^^^py constructs 

cells, thus allowing one " \ 3p,cif ic 

10 expressing conditionally lethal genes 

fashion) . offered by way of 

The following examples are offerea cy 

r,o«- hv wav of limitation, 
illustration, not by way ^ 

^ P-ferred ^t-t^J ^^^^ oligonucleotides 
codon usage libary xs -structe ^^^^^ ^^^^^^^^^ 

..ch that the centr^^^^^^^^^^ ^^^^^^^ .i.. 

20 type protein, but encoa ^^^^^^ ^^^^ 

degenerate codons. are not used. The 20 

„tic -/rron:: use nl-degenerate, but preferred 

,3ses at each end ^^^^^ ^^..^...es are assen^led into full- 
codons in E. coU. The g ^^g^^ied products are 

25 length genes as described above^ T ^ ^^^^ 

cloned into an expression -J^- J ^3 expressed 

art. in some embodiments, the codon y ^.^^ 
With a library of secretory ^^^^^Zn^i.^^^- ^ 
airects the encoded BIAP protean - ^^^^^^^^^ con^ination of 
30 library of leader sequences is used P^^^^^^ ^^^^^^^^^ 

leader sec^ence and -^^^ ' J^^^ 24:215-243 (1990) ) . 
reviewed by Schatz et al ^^^^^^^..^ control of an 
The cloned BIAP ^-^/J^^/Xe :abinose promoter.. Arabinose- 
inducible promoter such as the a ^ ^.^strate for 

35 induced colonies are screened ^^"^ ^ ^.^^est colonies 
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herein. «„„^-mlr'^ion of the codqn usage 

The oligonucleotides for construction or t 

. T«ble II The corresponding locations of 

library are listed in Table II. 
these promoters is provided in Figure 1. 



Table II 



5. 
6 . 

TTGAAGAGGA 



, ;*CCCTC»0 TTCCMCCC CT.™ TC.CCCT3C0 
\' «CCCTCCAG TTCCG^ACCC CATATG«^ AAACCGCT 
,■ AiCCCTCCAG TTCCGMCCC MMACATAT GCGTGCTMA 

: ^™g .tccgmccc , 

.ACCCTCCAG TTCCGMCCC GOTATACATA TCARACAGTC 

TCTGCTCAGG C.ATGGCOGT ^CA^ C^<='^ 

gcaccg™ OCAKGCPGC OCCAAKGA. CTGGTTCCGO 
, . ™::^C GCXATCTCTT CVTTVGCTC .ACGG^ -GOTTCCOG 
, iTc^ -CAGCCGG COATGGC.AT GOA^. CTGGXTCCGG 
,0. ™S O^CACCCCG G..ACVAAHG CBGCCAKG. OCTGGTTCCG 
TGGAACCGTC AKGC^^GCCA RGCCT^- ™AAAA 

. ^0^™ -™ -™ """"" 

,3 ^^^S^G^C CGGAAAC^C GAVCA^TTVC CO.ACG..GC 

Src^Tc x™a cgcacvgc .™^c gg™ 

° S^™A CAACCAGTGC AAKACVACVC G.GG™ ^-TACCTC. 

:™^™™cACVA™cA.cA.Ga..crccoGC^ 

3, Z:::^C GC^CC^GC c™.C :>CAKATGAAC GGT.GCCAOG 

.e. TATCCTGGGT GGVGGVCG.A AHTAVA^- CCCGG.TOGT 
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r« OTOXTC.^ ^VCTOCTtC .OOC^O- 

3„ r« »oa.c^ vcvxc^cr otxxccccc 

~=T OCOC^TO 
CTTTCT^CCT O.X.O™- . TCC.CC*CO. 

,3 ™^ m;^v.c..x «CT 

CTCCCTT OV«.™V.C CCXOC.TOCT 

33 ^^^^T CX^-- CTOX^VCOV. 

TCX.CCXCX. ™VC»C^ C.O.CXCCXC, 

„ ™™ C^OCVCCC »™CX OOTXC.C«.X 

,3 :~c: CXOOT^ ~« -CXOCC«=C 

cccccocx oc.™ 

3„ Src: — CX x™O.C XHOOHCCO.. ^-OC 
3. r^cS: rxO.OC.OCO x™. -XC«.^^^ 
3. O^^^OCCXO .XKXOHOCH. C«C X^O^- 

3„ 33 :™ro a.coxcrrc.ccHca,xo. oxvxc.0^0 ccxocoo^c 

3, ::"o XCOXX«.COX CHOO.CO.O. .CCCCCCC .O.OCOX..C 
3, OXCC.O.OCX XX— CCOHCC... 0.X.O.0OX. 

3e SSSTx OOXCOCOOX h.cc.ok.xc .oKoxaxcc oxxcccox 



35 

36. 

CAGTTCGTTA 
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3,. c^CTACCA GCTTCGGTCA GHGCCTRT* HGCY™ TCG«K.,^C 
3a. rG"c acgcggg™ COKGAHACHA CHCGCACHOC aacttcg-^c 
3,. 'rc^GrT^ CGTTGTAm CATRTCHGCH GG^CHAACA GACCCATCAG 
GTACGGTTCC A».™ HGCKCCVTCO TGTTTAGCCT 
„ . taS^STc G^AACAGAA GCRTCTCHG GBTAVTCHGG OTCCGGGGTA 
'cS^™ CGTCGATGTC CATRTT.™ ACCAG.TGHG CAGCGATGTC 
,3 TCAGAGTACC AKTTRCORTT HACKGTRTGA GCGTAAGCAC 

™cL^ ACCAACAGA. ^CCHGCVT TV^GCHCG ^CAXAACA 

«. "S'^St aacgagcagc hgcrgahach ccratkgtrc ggtagttacc 
„ IT^'^g tccggaacct gkccrtchac rttrtakg^ ttagacagag 

^™ a™a GA.A03GG.A 

.S. GGA.AACGTT VTTHGCHGCR GT^TGRA^G GCTGCAGXT. 

TTTAGCAACG 

^r-fiGTTCCAG AAAGCCGGGT CTTCOTCTTC AACCGGAACC AG 
1 cSrGCrA CATAACACCA GCHGCHACHG CHACHGCCAC COGCAGTTTA 

S. "ZTZl ACAGCAGCGG CAGCAGHGCC AGHGCRATRG TRGAC«.XTT 
, "gCCAGCAG CAGCAGRCCH GCHGCHGCGC .CGGCAGCAO 

,3. Z^C GATCGGGGTG GTCAOHACRA TRCCCAGCAG TTTAGCACGC 
GAAACCAGCC AGHOCHACHG CRATHGCRAT AOCGGTTTrT 

c TTCATATG 

55 AGAATTCTCT AGAGGCGGAA ACTCTCCAAC TCCCAGGTT 
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56. TGAGAGGTTG AGGGTCCAAT TGGGAGGTCA AGGCTTGGG 

: rrmT^"" The code for degenerate 

positions is: R: A or G. Y. or 1. 



5 T. 



10 



15 



20 



25 



II. Hasfflllisa^S=fSS«Ji«=^ antibodies naturally undergo a 

c:::diri:u= .r t.eir j-^^^^^^^^^^^^ 

driven by somatic hyper^tation ^^^^^ ,,;,3.,, 

.lonal selection .Bere. and m ste.n 

„,S7)). patten et al. "„„iya= antibody fro» the 

reconstructed t.e -J ;.: :rop>.enylphospho„ate hapten 

germline sequence, which binds P .jjinity matured 

""•^ - ^^rL:^:::-"rnrs;::r^^^^^^^^ - 

rarrnify o^ rnLllar. «.e a«inity ^turation o.^this^ 
antibody can be recapitulated and impro y ^ 

with PCR) , 

mutagenesis o. the J-jT^.T^^^^^ " - ''^'^ " 

„a-aUan display, FACS select ^^.^^^^ ,„,oding 

rapidly evolve improved af£inicy oy 

improved binding. =v„^Me vectors similar to 

oenomic C^^LIl^-^ 

,.„se --^f-;^^;;;:;: ; re ;o:=r;::^;r;^«r^ 

l!L.S^ 84.2936-2940 (13 ) _^ ^^^^ ^^^^^^^ 

„™tant V region axons can be readily ^^^^^.^^ 

.actors. -r--=;::rrh: h::v: h"n Ts cloLd onto a neomycin 
puromycin resistance and the heavy ^ ^^^^ ^^^.^^ 

, resistance ^"^^^''^ ZZ.J^^ 7r^^^''^ " 
se^ences ^-^^^^^j^^J^^trmutagenesis into genomic exons 

"^'T. Tsfi I "s with complementary Sfi I sites placed at 
flanked by Sfi I sites wi j shuttle vectors. The 

..e appropriate ^^^^^ ^Z:::":.. X Sites nankin, 
5 oligonucleotides used t° create ^^^^^^^ tACATOGCCG AAGGGGCCGT 
the VDJ exon are: 5' Sfi i. ^ CATGTTGGCC TCCTTGGCCG 

GCCATGAGGATTTT-3'; 3 ' Sf i 1 : 5 ' -TTCTAAATG 
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OATTCTGAGC CTTCAGOACC A-S'. Standard PCR '^"^'"'l^' ^^"^^^ ",1 

applied to produce libraries of mutants -herein the following set. 
Tr sidues .nu^ered according to Kahat, o. ^ , .,.., of proteia ^ 

, T.terest. U.S. Dept of Health and Hu^ Services, 

1S91) are randomized to NHK codons (GRTG.GATC.GC) : 



Chain 


CDR 


Mutated residues 


V-L 


1 


30, 31, 34 


V-L 


2 


52, 53„ 55 


V-H 


2 


55, 56, 65 


V-H 


II ^ II 


74, 76, 78 



stable transfectant lines .re mad. for each of the two 
light and heavy chain constructs (mature and germlin.) using the 
In myeloma 7g8-.53 (a gift from Kearney, as a host using 

anda'rd electroporation protocols, libraries of ~ ^ ^jf 
encoding the indicated libraries of V-L mutants 
into the stable transformant expressing the germline V-H, and 
V-H mutants are transfected into the germline V-L 
transfectant line. In both cases, the libraries are «"°<^"«^/>' 

protoplast fusion <Sa*roo. et al., '^^^^^'^^'^'f^J^^flJZZ 

fh^^ ma-ioritv of transfected cells recexve 
(1987)) to ensure that the ma]oriuy ^j- ^ ^ . ^.v.^ 

" all only one mutant plasmid ^^-'^^^ .^"^'f^'^ZlTS 
case for electroporation where the majority °* "^^f/f^^^ 
cells would receive many plasmids, each expressing a different 

^tant -^-«'.;_^„„^,,„^,,,,^,„„«e hapten recognized 
by this antibody is synthesized as described by Patten et al^ 
Science 271:loe6.10» (1556,1 . is coupled directly to 

"^minoethyllthio, acetyl, fluorescein . f^' 

L ormation of an amide bond using a standard -"^^-^ ^-^^^'-^ 

such as E^C ,«arch, ^M^^^^-^-^^^^^^'J''^^^^^^ 
achn Wiley and Sons, 1985, to give a monomeric 
A "dimeric. conjugate (two molecules of coupled to a FACS 

marker, is made in order to get a higher avidity probe, thus 
laLng low affinity interactions (suoh as with the germ ine 
antibody, more readily detected by FACS. This is generated by 



10 



15 



20 
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structure of IgG then pro ™j.i-FITC molecules that are 

spin coluM, is used to ren^ve excess J „,aa 

„ct .ouna to tHe ^^^^^^^^^^ ^^u. .UH e..C to two 

'°"rts ofeti:^ le faline, and this is then .e couple, to the 
equivalents o£ ^"yl bi^tiylatad JWJ-1 product is 

j„e J! toeraphy and characterised by .ass 

purified by iXlled avidin is incuhated with the 

spectrometry. FITC laoeiie tetravalent probe, 

biotinylated in order to generate a «trav^^ P ^^^^ 

.he selection IS performed ^'^^^^^^^ 

a protocol similar to that of Pan.a ^^J^^ ,,^,,,i,s of 

S5-.30S0-30S4 (1986,). ^"/^'""f '"rtusion (with 

^--11 ^7^1::: r-: rrrated ice with 

Tirfcriy^la^- hapten. . .e ^^^J: ^^l^^^^^ 
„i„i„i.e pinocytosis Of -^rj^^ed In the P.CS . 

nonspecific background. Ihe cells ^ p^csing without- a washing 
either with or without a washing step FACSing ^ ^^^^^^ 
step is preferable because the off rate for t^e g ^^^ ^ 

" -""TarTcrr ^--rio:: us.en , > washing 

ridirir^Uca^St^Iable. .he brightest 0.1 - 10. of the 

i cells are collected. ^,„it,ulated to optimize the 

Four parameters are manipulatea to f 

selection for increased binding, ™ric - -""—^^.^^^^ 

tetrameric hapten,. 'j;/ jjinity Kd's,, time 

reaction (low concentration selects for high ^^^^^^ ^ 

0 between washing and -longer '■^^J^^^; „ore 

"-"--^l; r:,r"l CO strfcts expressing the germline, 
rt^iniattHorinations of half germline are used as 
controls to optimize this a-lactivity. 

Plasmids are recovered from the FACS sei 
35 Plasmio ^.^^ supematants. 

the transformation of an E. coll pcR-amplif led from the 

Alternatively, the mutant V gene exons are PCR P. 
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FACS selected cells. The recovered V gene exons are subjected to 
. RSR, recloned into the corresponding genomic shuttle vector, and 
the procedure recursively applied until the mean fluorescence 
intensity has increased. A relevant positive control for improved 
5 binding is transfection with the affinity matured 48G7 exons 
(Patten. et al., op. cit.). 

In a further experiment, equal numbers of germline and 
each of the two half germline transf ectants are mixed. The 
brightest cells are selected under conditions described above. 

10 The V genes are recovered by PGR, recloned into expression 
vectors, and co- transf ected, either two plasmids per E. coli 
followed by protoplast fusion, or by bulk electroporatiOn*. The 
mean fluorescent intensity of the transf ectants should increase 
due to enrichment of mature relative to germline V regions. 

15 This methodology can be applied to evolve any receptor- 

ligand or binding partner interaction. Natural expression formats 
can be used to express libraries of mutants of any receptor for 
which one wants to improve the affinity for the natural or novel 
ligands. Typical examples would be improvement of the affinity of 

20 T cell receptors for ligands of interest (i.e. MHC/tumor peptide 
antigen complexes) or TNF receptor for TNF (soluble forms of TNF 
receptors are used therapeutically to neutralize TNF activity) . 

This format can also be used to select for mutant forms 
of ligands by expressing the ligand in a membrane bound form with 

25 an engineered membrane anchor by a strategy analogous to that of 
Wettstein et al. ( J. Exp. Med. 174:219-28 (1991)). FACS selection 
is then performed with f liaorescently labelled receptor. In this 
format one could, for example, evolve improved receptor 
antagonists from naturally occurring receptor antagonists (ILl 

30 receptor antagonist, for example) . Mutant forms of agonists with 
improved affinity for their cognate receptors could also be 
evolved in this format. These mutants would be candidates for 
improved agonists or potent receptor antagonists, analogous to 
reported antagonistic mutant forms of IL3. 

35 

III. Evolution of Alpha Interferon 

There are at hand 18 known non- allelic human 
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1 v,= n^F-cy) qenes, with highly related primary 
interferon- alpha (INF-a) genes, ^ biological 

/-TR q^'- identical) and with a broad range oe u±» s 
structures (78-95* idencicci ; interesting biological 

activities Many hybrid interferons with interesting 
activities My y p^^ental molecules have been 

activities differing rrora u«c f Ther 

, ^ v,v Horisberger and Di Marco, Pharm. Ther ^ 

described (reviewed by Horisberg interferon, IFN-Conl, 

fiqqsn A consensus human alpha interrexo , 

ccu,pares favorably «th -^"^ ^ ^ ^^^3 containa 20 

et al.. UntfiSf^SSTL^ ' '™ ja the IBF-^ w which it. la 

.^„o acid :;;^",-!, :,Tar Ifofa hi,h« speciUc 

most closely related. IFN Conl subtype. IFN-a Conl . 

antiviral activity than any known natural IFN 

antivira .^^ to 20 fold higher than that of 

has in vitro activities lu r-iinicallv) in antiviral. 

w .r,^ TFN a-2a (the major IFN used clinicaiiyj 
recombinant IFN a ^^""^ ^ . t-here is 

.fl Mif cell activation. Thus, tnere 
,„UproUfera..ve ana ^ ^^^/^^^ ,,,,,.3 which 

considerable interest in p interferons, 
combine the most desirable traits 3 ^he 

However, .ivan the enormous number f POte t a ^y^_^ ^^^^^^^^^ 
lacK Of a crystal -t-cture/f IFK " ^ ^ ^^^.^as 
there is a perceived impasse in '"^ J^^^^/^ ,„35), . 

alver a. 

^ :rin:rh: rnrction of ciass i ana <^2z::^.^t::^ 

Of monocytes and macrophases, "synthesis in B 

activation of cytotoxic T cells, moaulation of Ig synt 

cells, --7;-:3t™:"- subtypes have uni<.e spectra of 
" activities on alfferenf target cells ^-f^TZ!!^^^^-*^^^ 
4929 (1984); Overall et al., J_in£ --— ,.^.546 

- ra:i;,rmarit'hin;^;^i.e e»ects - - 

Human IFNa8 has very high antiviral activity. 



activity. 
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M. .ffects Human IFNoT lacks NK activity and 
relatively , . „u^n IFK-« J lacKs the 

blocks NK stimulation by other INF 

^^"rrreu: rtiiato. activity i™.. 
rrerr- . ----^rr— = a. u^tea 

.he therapeutic appUcat.ons o ^^^^^^^^^^^^ 

by diverse and severe f//"^.^^^,,,, ,„,i„,i„g hallucination, 

symptoms, fatisue, leukopenia. The multiplicity 

tever, hepatic enxyme ° ^j,^ hypothesis that there may 

°' ""t "orrep::: : Tr^onfnt recepto. .or the 
be more than one receptor ^ ^^^^ ^^^^^ 268 = 12591-12595 

ira-o, family (R. Hu - ' „^ naturally occurring 

(1993)1 . Thus, the „3 a large sequence 

diversity within the human '^^l^^J™ „f the IFH-a 

.pace o. "----fie'r/ear al oprtunity^or the 
receptors and activities create 
construction of superior hybrids. 

A r-nr^-l-Y "« f!t,quCTce,jEacs _ _ 

■;i;^;;Ti;;;:;ri;;r^«ei7Bequenceso^ 

,™...s. Th! differences from — ^ ^^^^^^^^^^^^^^ 
positions Where a degenerate ~;™;-;-';;:„,„,,,o„ .be 

diversity are ^^^^-^^^/fj^^f^r^e 57 positions with two. IS 
aligned se<riences reveals '^^^ ^^^^ ^^^..^i, ,,i„o 

, positions with three an. , p sl^ ^^^^^^^^^^ 

acids encoded in this gro f ^^^^nn ot all of this 

potential diversity encoded P=™""T, ,.3 , ,0-. 

naturally occurring ^~;::;^rphisms spread over a total 

Among these hybrids, of the / p changes 

,0 of 1.5 sites in the 11 ihterferon genes IJl °J 

can be incorporated into for example, . 

J =1- fhe corresponding positions, rui. 
degenerate codons at the ^ degenerate codon 

Arg. Trp and Gly can all be encoded y 

,,,,..00. .Sing sue. aj-^^^^^^^^^^^ rUgonlcleotides . is 
35 captured with ^ J-^^^f oligonucleotides is sufficient 

evident from Tables III to VI 2 J virtually all of 

to shuffle all eleven human alpha interferons. 
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, ^- v=*tv is thereby encoded and fully permuted due to 
the natural diversity is thereoy in Table V. 

degeneracies in the nine "block- oligonucleotides 

r:::^.ructure of .™ alpha .Kontse.^ 
,» ,45 360 U994)) has been divided into nine segments based on a 
38,345-360 l"'^'' „,i„„ining secondary structure 

combination, of criteria o ^^^^^.^^ ^^.^^nt of the 

:r rbiraS: rUions^f iX";^^^^^^^ 
— - "^re Txraid^rgiyrgirtL 

oligonucleotides. Table 9 ^^^^^^ 

locations of these boundaries at '"^^ particular 

• T Th «?hould be emphasized that tnis pax 
respectively. It / ^ other segmentation 

segmentation scheme IS arbitrary ^^^^^^^ 

schemes cou d '1=° J''/^^^;^^ J^,,aaries at regions of high 

.epend ^/JfJ f^en^ers or on any particular algorithm 

identity between tne ramiiy 

for breaking the structure into segments. 

TaVile III 



5 



)0 



Segment 


Amino Acids 


Alleles 


# Permutations of all 
Sequence Variations 


1 


1-21 


5 


1024 


2 


22-51 


10 


6.2 X 10* 


3 


52-67 


6 


96 


4 


68-80 


7 


1024 


5 


81-92 


7 


192 


6 


93-115 


10 


2.5 X 10^ 


7 


116-131 


4 


8 


8 


132-138 


4 


8 


9 


139-167 


9 


9216 
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10 



15 



20 



segments, and thus there are ^^^^^^^^^ of the permutations 

each =-™ent Thus^; -^^^^^ ^ ,o^,e.itv o. .1 

Of the segment "alleles wo „^ a? ■ s x . . • • ^ 9 segment # 

X 10' (S segment ^-^-^trr — - most of the 

• :rp— rr-.er ana thus this U a good pro.Xem 
::r:::: Sr:: search the serene, space. 

9 

«.e .ethods Uv^d to construct 

shuffling (i.e. bridge "^^^-^^^^^.f the general methods 
libraries of i--^"™ '^^^^^f;/;,; or select these mutants for 
.escribed a^ve are e^P loved to scr« ^^^^^ , 

rrr—rfori^^^^^^^^^^^ 
ra^ororr bfsra:::; are de^^^^^^^ 

1 f«r a Coarse--grain-Searc h. of H ybri c 

^^^''''^^T^.ries are constructed wherein the 11 
" »; the nir-gments are permuted (note that in 
homologous forms of the nine g ^^^^ ^ ^^^^^ . 

5 many cases two homologues ^^^^en IFN alpha 

Ml nine segments are f^"^" .^^^^/^/^.^/^ ,i,,,a in Table IV. and 
,enes with the eighteen ^^^^^^^^'^^If^ oXi,c directed 
reassen^led into full length 9^- °^ ^^^^ ,,ones from the 
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reasfcu"^---- „„mK«r e q.. 1000, cxouco • 

.eco^inaticn. *n arhrtrary 7;f'^;^^:^:3,on/purif ication format, 
library are prepared in ^ "-"f^ sre screened. 

Hybrids with the most potent '"^ subjected to 

«ucleic acid is -^^J^^^J^onucLotides. These steps are 
recombination using bridge " J properties are obtained, 

repeated until candidates with desired p 
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each ot *e nine se..ents is synthesized wit. 

i::::: ::Ve:e::rer:Lein .n 

.ut --------rArtr^^^^^ .-tic 

positions where ^'f^^^ captured with a 

code, in most cases all of /aeqenerate codon will 

capture all o£ the natural i^le to capture the 

station, at a £a« postions t iS P aegenerate codon 

„«ural diversity ---jrLn^n^ur:! lutation. It is at 

which will "°"^^^,:^a set of oligonucleotides will 

,,ese positions that t,rs sec ^^^^^^^^^ ^^^^ 

differ from the first set By ^ pcR with the 18 PCE 

„,„e svnt-t- ~ Ten "4 L oligo directed 

oligonucleotides. Full leng a „3„3jected into a host, and 

recombination method are generated, transte ^ 

,ssayed for »V-;-;;- Tr frrably the top 1., are 

:::e::e^d' tnsranr;;e^;rocess repeated until a candidate with 
the desired properties is obtained. . 

F rir" u^^^^rr each 

. v,«„d one could make libraries wherein each 

is rerredTrrt — te synthe^^^^^^^^^^^^^ 

„.ich will encode --^rr^^ ^ rr.el'y search the 

in this case, the initial library J' ^^.^^ with 

space of >10» possible fine ^^^^^^^^^Mr^ POsiti.es 

Tstarr To eX "ere^-- - ^ ""^Z 

rftirfLrstu:: — - LTe^y^re^i::' 
= ::itf rera:i::irdr;e:g::n^^^^^^ -d-be recombmed at 
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each step. 

»n^r,^T^» Fin r Search 



This starting point could be either on ^^^^ 

widely therapeutically) , the cn .„£ji^d IFN-alpha's 

interferon, or a hit £ro» -"^^ 'f^/'^^f^'^.a .ake separate 

srr»;rr. r:s :r;.; - - 

all throughout the molecule. 

.he foiiowm, -u :nri uur.; :ta„aara 

.™ alpha — -nra:e1or.i„. units, very low 
error of 30-50«, inni ^ffet-ts) • reduced viral yield 

standard error (can measure . i^ibition of 

...eful for nonlethal. nonpla<^e ^J-"^ ™^ ^ „lls to 
cell growth (3H-thy.idine uptaKe £orlation by human IBF 

screening. ^^^-^^ ^ throughput formats such as 

relt^std purification in 
, .„ti-x™ antibodies or an eP Uope^te »a aff i ity^^^^^^^^^ 

purified x™ P/^J'^^fjrnts encoding the highest activities of 
format. „,,er mutagenesis, such as RSB.. and 

interest are subjected " ,,,i,ity is 

the process repeated until a desirea 



35 obtained. 
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;;;^^;;7p;rge display formats are ""^ " ^^f/f^^ 
TCK. libraries of chimerxc IFN genes are 
.,.X0.i=aUy act.v ™^ .positively or 

expressed in this iorm<i v,^T,^ina^ to one or more 

negatively) for binding ^^'^^^^ ^^f^^^^^^ 
purified IFN receptor preparations or to one 

expressing cell types. 

S^expressea by mutant, can ^ screened in high 
protein e P ^.^^ ^^.^^^ expresses GFP or 

throughput to«at on a repo ^^^^^ ^^^^^^^ 

luciferase under the contro exoression. 
such as an «HC Class I promoter d«v«g GFP expr 

, active IFN will liTnit the throughput o£ 
Purificatron of active IFH ^^^^ ^^^^^ 

the assays described above^ X " obtain ho^enous 
filamentous phage «13 wouU ^^^^^ ^,,„,,„,s or tens of 

preparations of IFN — "^^ly be handled. Ora. et aX. 
thousands of ™atants could adr V ^^^^^^^^^ , 

^■^^^•r; pr'te n' 'd i.ilar in topology to IFN alpha, can 

Tz:^s: r surface Of — -riiSt:: 

-tr-orrs.- :s:r. rrha:"ho» that hu^n 
:rrne— .actor, a - -rcorera:.:^^ 

.iologically active --/^ ^alogously, libraries 

similar to those of the solubl J ^ .y,,,,, „( 

of IFN alpha mutants on M13 can be exp the . high 

defined titre used to present biologrcally active 
defined titr _ j^^^ described herein, 

throughput assays and selectio ^he feasibility of 

The following calculation supports the 

spplying this technology to IF» alpha^ ^^^^^r^.J^'aisplayed 

-;:r ar rtrthT d:;i::: -aientiy 
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.etive to soluble .eco-inan. ^^^^^J:::^ can 
potent due to .ulti -valency) the ^--^^ ^^^^ 
reasonably expect to see biologxcal act.vxty. 

10 V, ^/n,!^ X (5 IFN molecules/phage) X (1 
mole/6xl023 molecules) x I2b,uu a 



a 2.2 ng/tnl 



10 



15 



20 
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1 ng/na for NK activation, 0 1 ^ ^^^^^ ^^^^^ .^zes at 

activity on Eskol cells, ana . ^^^^ ^ ^though some subtypes, 

,1., 2. Tn^^rfPron ..^^^ consensus interjeron are 

are glycosylated, interferon alpha2a an ^^^^^ ^^^^^ 

e^ressea in active Z^^, x™ alpha 

t„c do not require glycosylation ^ biologically active 

e^cpressed on filamentous ^^J^J^ ^^,,/,,,i,„. libraries of in. 
as phage lysates without ^''f''^'^^" scored in the 

Chimeras are expressed ^^ ^^^'-^X mutants with in^roved 
3.eays described ^"-^ ,„„„a3 of RSR. 

properties to be put .'""f activate one cell due to 

mien one phage is sufficient ,^ 
..e high valency state of the displaye « " ,1^,., 
tne gene XXI format; hundreds per p a.e ^^^^^^ ^ ^^ed 

.ens in the ^^^^.T^JJ^^ 'JZ^Jc^^^ with a OPP 
directly at suitable dilution responsive 
reporter construct under tne - -^^^ .,„r 
promoter. Assuming '^^/^ification of the responsive 

stimulation, expression a"^ J«= ^ ^.^^ „ith improved 

C.XXS, one ^-^^ ^Ti ^a-- ^ "^""^ 
I activity from very large no 

IQ-' phage per FACS run) . advantage in this 

. second way xn wh h p.cs ^^^^^^^^^^ ^ 

format is the following. Cells c ^^^^^ construct, 

format with one lysate ^ ^ ^llect the brightest 

5 All stimulated cells are FACS purxfx ^^^^ 

— irorthrprft::: —^^^^^^ - — 
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V, „ in this protocol the etimuUtion i. performed with 
is obtained. In this pro requirement that a 

individual concentrated ^V-^s and hence the 

Single phaee be ^^;^:'::titls. cells which, in 

Furthermore, one can gate to ^^H/^' ^^^^^ ^hem. 

turn, should have the most potent phage attached 

i™ alpha ™.tants. fj/trp ^teln is -^P-^^^ ^ 
, advantages over phage display. First, P ^ 
a euHarvotic cell and hence can he expressed in^ prop V 
glycosylated form which may '^^^J^"^^^^ ^^^^.r. valency 

5 weakly active mutants. , ^»r,^ IFN' s is constructed 

Tn >^rief a library of mutant. IFN s is _ 

tail has been {used to the carooxyi , rv,^. Med. 

protein for surface expression ^'^'^^'^f 

',4,219-.8 The library is ^"^ ^.^rotiter 

^-:rt:vrre'rt::::rwirhr^^^^^^^^^^^ 

format. Positives^are ^^^^^^ and 

rernsformirn :rrrt or by .. and are subjected to 
25 for further evolution. 

30 generated m a vecco efficient secretion. The 

a.e. metallothionein promoter, and e icien ^^^^^^^^ 

recipient cell line carrying ^"'J^^'^c.ion with the mutant IPN 
lOFP or luciferase) is induced responsive propter 

constructs. Mutants which stimulate the IFH r p 

35 are detected by by F^CS ^r^^;^ transfectants 

variation on this format is 
with virus and select for survivors. One cou 



h €?et of transf ectants 
of Viral challenge and of UllUng and 

prior tc retrieving the genes ^ small advantage 

::ra:''arararge in^etecting s»U i.--nta in 
5 viral killing. 

Table_l2 

A^A forbl2ckwise.^econ^i^^ 

" •-ACAO.TTCTLG/TlClC/T/Ci™ 

5 . -CAGTTCCAGAAG lA/G] CT [G/C] [C/Al AGCCATC 

, MtggctIt/gUc/cIagIt/cIcttctggaactg 

5. .cttcaatctcttca[g/c] caca 
c. tgtgig/c]tgaagagattgaag 

.rMr/L.c/mGAec/Git^^^^^^^ 

I . TCTAGGAG IG/Cl [G/C] TCT [G/C] IT/A] TCC 

.:rCTTtT/G/AHT/AKCAGC^^VC^^^^^ 
5 . - ATTCA [T/Gl TTGCTGG [A/Tl [A/T/CI AAGTTC 
5.-GGACTIT/CICATCCTGGCTGTG 
5..CACAGCCAGGATGtG/AlAGTCC 

^^^K rnr*T 
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3. 5' 

4. 5' 

5. 



7. 5 

8. 5 
9. 

10 

11. 5 

12 

13. - 

^^^TvTvn'PTvrTCTTTATCT 

15. 5 



5.-AAGAATCACTCTTTAT 
5.-AGATAAAGAGTGATTCTT 
5 . -TGGGAGGTTGTCAGAGCAG 
CTGCTCTGACAACCTCCCA 

r ^ y 1 r^n 



16. 

5. -TCACA/TITCCTTICMICTCIT/CITTM 
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.raceta indicate -^-"^ J ^^fst of fh: degeneracy 
specified basea at thoae P°"^--^ ..^rs of the 

to allow this °- ° choice of the oligo 

IFB family with similar because they wiU get 
,,,.en «co„.ination points r .^-n ^^^^^^^^ 

• overwritten, rn -h cy^l ^^^^ selection, 
with the rest of the sequ 



100 
Tabl__V 



A^A for H| 1^ I II lin" rr— "^^^"'^^io"- 



Block 

1 
2 
3 
4 
5 

6 

7 

8 

9 



ttLength of oligo required 



76 

95 

65 

56 

51 

93 

50 

62 

80 



TaM e VI 



Acid, 

w 

Y 
F 



Atnino.aciJs.^^ 
mnt-ation 
C, R, G. L 
F, S. C, H, N, D 
L, I, V, S. Y, C 
S, W, F, I. M, V, P 
^ F. L, I. M. A. D, E, G 

F, 

S, P, T. V. D. E, G 
V. A, D, E, R. S, C, W 
L. I, V. T, K. R 

F. L. Y, C. W. P. T, A. R. G, N, 



^ L. M, V. T. N. K. S. R 

I 
A 
G 
M 
S 
I 
T 
P 



S, P, A, I, M, N, K, S, 

S, T, A, L, H, Q, R 

F, S, Y, R, G. W 

^ Y, H, K, D, S. T. I 

N 
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Y, H, K, E, L, P. R 

^ Y, Q, N, D, L, P, R 

Y. H, N, E, V, A. G 



H 



I Q. K, D, V, A. G 

L. P. H. Q. C, n, S,.0, K, T. I. M 

^ Q, N. E. R. T. I, M 

K 

. .his Table, the polymorphic positions in IFN 
Z the dlfers t,.can be captured by a degenerate 
alpha Where all o^^^j/^^ oligonucleotides of the length 
rrcareriXr re--- L degenerates inferred fro. 

TltrrrUoing invention has ^^1^ 
illustration and example for purp^ 
some detail by way ^^/^^^f/^^^^ that certain changes 

Clarity of ^^^^^^^^^^l^^^^^ the scope of the 

and modifications may be practice 

appended claims. expressly incorporated 

All references cited herein ctx 
in their entirety for all purposes. 



